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A general,  systematic,  theoretical  and  experimental  study  of  the 
ion-exchange  process  (IEP)  in  glass  is  presented.  The  study 
encompasses  characterization  of  the  graded -index  (GRIN)  waveguides 
fabricated  using  the  Ag+-Na+  IEP,  as  well  as  the  application  of  these 
waveguides  to  practical  devices  such  as  tapered  waveguide  transitions 
and  multiplexers  / demultiplexers  (MUX  / DEMUX)  using  symmetric 
directional  couplers.  The  IEP  as  well  as  GRIN  waveguides  and  the 
devices  have  been  modeled  and  excellent  agreement  has  been 
obtained  with  the  experimental  results. 

The  modeling  starts  with  the  simulation  of  the  IEP  in  glass.  The 
1-D  and  2-D  nonlinear  diffusion  equations  are  solved  by  a finite 
difference  method  (FDM)  to  obtain  the  index  profile  for  both  the 
single-step  and  two-step  diffusion  processes  with  and  without  an 
external  electrical  field.  The  modal  characteristics  of  the  waveguides 
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are  attained  by  solving  the  Helmholtz  equation  by  FDM.  The  beam 
propagation  method  (BPM),  used  to  simulate  the  wave  propagation  in 
devices,  is  also  illustrated. 

The  fabrication  and  characterization  of  low-loss,  single-mode 
planar  and  channel  waveguides  are  described  in  detail.  The  waveguides 
are  obtained  by  exchanging  Na+  in  the  BK7  glass  substrate  with  Ag+  in 
a molten  salt  bath  consisting  of  NaNC>3,  KNO3  and  AgNC>3.  The 
resulting  guides  are  characterized  in  terms  of  the  mode  indices, 
modal  fields  and  attenuations  at  1 .3  pm  wavelength. 

The  results  are  correlated  with  the  theory  and  are  further  used 
to  predict  the  index  profile  and  the  modal  field  distribution  of  tapered 
waveguides  obtained  by  postbaking  surface  channel  waveguides  under  a 
thermal  gradient.  The  radiation  losses  in  tapered  transitions  with 
various  taper  profiles  and  lengths  are  simulated  by  BPM.  Lossless 
transitions  are  attained  for  5 mm  long  tapers. 

Symmetric  directional  couplers  as  wavelength  MUX/DEMUX  are 
modeled,  fabricated,  and  tested.  The  coupling  length  of  the  couplers 
as  a function  of  wavelength  is  determined  by  a novel  technique 
employing  the  spectral  response  of  the  normalized,  measured  power 
outputs  of  the  two  guides.  MUX/DEMUX  operating  at  1.315  and  1.561 
pm  with  crosstalk  and  cross  power  ratio  of  at  least  -40  dB  are 
demonstrated.  A total  insertion  loss  of  1.25  dB  for  a 14.5  mm  long 
device  and  propagation  loss  as  low  as  0.15  dB/cm  have  been  achieved. 
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CHAPTER  1 
INTRODUCTION 


A new  era  of  optical  communication  systems  was  started  in 
1970s  by  the  achievement  of  low  propagation  loss  in  optical  fibers. 
Optical  communication  has  grown  from  its  infant  stage  in  the  early  70s 
into  one  of  the  major  industries  today,  thanks  to  a parallel 
development  in  the  associated  technologies  such  as  reliable 
semiconductor  lasers,  high  speed  modulators,  high  efficiency 
detectors  and  monolithic  integration  of  the  above  devices.  As  the 
systems  become  more  and  more  sophisticated,  the  criteria  for  cost, 
fiber  compatibility  and  performance  are  becoming  more  important. 
Therefore,  extensive  research  and  development  efforts  in  this  field 
have  continued  to  meet  the  stiff  requirement.  The  future  of  the  optical 
communication  indeed  is  extremely  bright. 

Optical  communication  system  consists  of  active  components, 
e.g.,  laser,  modulator,  etc.  as  well  as  passive  components,  e.g.,  filter, 
power  divider,  etc.  To  realize  these  passive  devices  and  satisfy  the 
criteria  mentioned  above,  glass  waveguides  fabricated  by  ion  exchange 
technique  have  recently  received  considerable  attention.  This  well- 
known  ion-exchange  technique  was  used  in  early  days  to  strengthen 
the  glasses  and  produce  tinted  glasses.  It  has  been  used  to  change  the 
refractive  index,  color,  strength  [Ohta75,  Fine88,  see  ref.  list  in 
Kara79]  etc.  in  glass.  Ion-exchanged  glass  waveguides  and  integrated 
optical  devices  on  glass  substrates  are  one  of  the  best  choices  since  1) 
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optical-quality  substrate  material  is  generally  inexpensive  compared  to 
its  competing  crystal  substrates  such  as  LiNbC>3  or  III-V  compound 
semiconductors;  2)  fabrication  is  quite  simple  and  can  be  carried  out 
at  a relatively  low  temperature  and,  thus,  the  cost  is  reduced;  3) 
refractive  index  of  host  glass  is  compatible  to  that  of  the  optical  fibers 
and  the  reflection  loss  at  the  fiber-device  interface  is  minimal;  and  4) 
high-performance  devices  with  low  propagation  loss  are  easily 
attainable. 

Optical  waveguides,  in  most  of  the  applications  in 
communication  systems,  are  two-dimensional  (2-D)  structures  that  are 
characterized  by  a region  of  higher  index  in  the  transverse 
dimensions,  capable  of  confining  the  light  energy  within  this  region 
while  guiding  it  along  the  longitudinal  direction.  In  the  case  of 
waveguide  fabrication  in  glass  by  ion  exchange,  the  refractive  index  is 
increased  by  exchanging  the  monovalent  network  modifiers  in  the 
glass  e.g.,  Na+  in  soda-lime  or  BK7  glass,  with  larger  polarizability 
cations  in  the  molten  salt,  e.g.,  K+,  Ag+  or  Tl+.  The  compositions  of  a 
typical  soda-lime  [Rama88d]  and  BK7  glasses  [Gort86b]  in  weight 
percentages  are  given  in  Table  1.  The  densities  of  the  soda-lime  and 
BK7  glasses  are  2.50  g/cm  and  2.51  g/cm  , respectively. 

Several  monovalent  cation  pairs  are  available  for  the  exchange 
e.g.,  Ag+-Na+,  K+-Na+,  Cs+-K+  and  Tl+-K+  etc.  Among  these  pairs,  Ag+- 
Na+  and  Tl+-K+  are  attractive  since  they  yield  large  index  changes 
(~0.1)  [Find85,  Rama88b].  The  index  change  can  be  lowered  by 
diluting  the  melt  with  the  exchanged  ions.  Thus,  with  the  above  two 
cation  pairs,  the  maximum  index  change  in  the  waveguide  is  adjustable 
in  a wider  range.  However,  it  is  well  known  that  thallium  ions  are  toxic 
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Table  1 


Glass  composition  in  weight  percentages 


Oxide 

Soda-lime 

BK7 

Si02 

72.25 

69.6 

Na20 

14.31 

8.4 

k2o 

1.20 

8.4 

CaO 

6.40 

- 

MgO 

4.30 

- 

B2O3 

- 

9.9 

BaO 

- 

2.5 

AI2O3 

1.20 

- 

SO3 

0.30 

- 

Fe203 

0.03 

- 

Others 

- 

1.2 
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and,  therefore,  great  care  must  be  exercised  in  the  fabrication.  Silver 
ions  are  easily  reduced  into  metallic  form  in  glass  via  reaction  with  the 
additives  in  the  glass  refining  process,  resulting  in  higher  losses. 
Nevertheless,  the  problem  of  silver-ion  reduction  is  eliminated  when 
high-quality  optical  glasses  and/or  lower  silver  concentration  (smaller 
index  change)  are  used. 

The  first  reported  low-loss  ion-exchanged  glass  waveguides  were 
fabricated  by  Izawa  and  Nakagome  [Izaw72]  using  thallium  ions  in  a 
borosilicate  glass  under  applied  field.  They  also  reported  the  first 
buried  planar  waveguides  (maximum  index  change  occurring  not  at 
the  glass  surface  but  below  the  surface)  fabricated  by  using  a two-step 
process.  The  channel  waveguides  were  realized  in  70s  by  ion-exchange 
technique  [Gede75b,  Gall76,  Wils76],  To  date,  several  passive 
components  e.g.,  power  dividers  or  couplers  [Cull84,  Yip84,  Maha88b], 
multiplexers  [Walk83b,  Mcco87],  filters  [Walk83a,  Hond84,  Conn87, 
Naja88a],  polarizers  [Find83],  and  sensors  [Honk87a],  have  been 
reported  using  ion-exchanged  glass  waveguides.  Furthermore,  ion- 
exchanged  waveguides  have  been  fabricated  in  the  nonlinear 
semiconductor-doped  glasses  [Cull86],  which  increases  the  possibility 
of  integrating  optical  switching  devices  in  glass.  Recently,  thermo- 
optic phase  modulator  has  been  demonstrated  in  ion  exchanged  glass 
waveguides  [Jest90]  and  rare-earth  doped  glass  waveguide  lasers 
[Sanf90,  Aoki90]  and  SiGe:H  photodetectors  on  glass  waveguides 
[Deim90]  have  been  demonstrated  using  the  ion-exchange  technique. 
In  addition,  feasibility  of  “transplanting”  III-V  opto-electronic  devices 
on  glass  substrates  [Yabl86]  and  incorporation  of  highly  nonlinear 
polymer  films  on  glass  waveguides  [Schl90]  suggests  that  opportunities 
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to  integrate  active  devices  with  passive  components  on  glass 

substrates  may  soon  become  a reality. 

In  this  dissertation,  a general,  systematic,  theoretical  and 
experimental  study  of  the  ion-exchange  process  in  glass  is  presented. 
The  study  encompasses  characterization  of  the  graded-index 
waveguides  fabricated  using  the  Ag+-Na+  ion-exchange  process,  as  well 
as  the  application  of  these  waveguides  to  practical  devices  such  as 
tapered  waveguide  transitions  and  multiplexers/demultiplexers  (MUX/ 
DEMUX)  using  symmetric  directional  couplers.  The  ion-exchange 
process  as  well  as  graded-index  waveguides  and  the  devices  have  been 
modeled  and  excellent  agreement  has  been  obtained  with  the 
experimental  results. 

In  chapter  2,  the  binary  ion  exchange  and  cation  diffusion  in 
glass  are  discussed  along  with  the  ion-exchange  equilibrium,  space- 
charge  effect,  multialkali  effect  and  temperature  dependence  of  the 
ion  exchange  process.  The  nonlinear  diffusion  equation  for  the 
incoming  cations  is  derived  from  the  Fick’s  law  with  the 
approximation  of  small  space-charge  density.  It  is  shown  that  the 
equation  can  be  linearized  if  the  concentration  of  the  incoming  ions  is 
low  or  the  mobilities  of  the  two  species  of  exchanging  ions  are  equal. 
Under  these  conditions,  analytical  solutions  are  available  for  one- 
dimensional (1-D)  cases  if  the  diffusion  coefficient  and  mobility  are 
constant.  However,  in  general,  a closed-form  solution  is  not  available. 

Since  the  closed  form  solution  of  the  nonlinear  diffusion 
equation  is  not  always  attainable,  chapter  3 deals  with  several 
numerical  methods  to  solve  the  equation  and  predict  the  resulting 
waveguide  propagation  characteristics.  Numerical  solutions  give  very 
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accurate  results  since  high  speed  computers  for  numerical  intensive 
calculations  and  library  programs  for  solving  difficult  problems  are 
readily  available,  evoking  the  numerical  analysis  to  play  a major  role  in 
the  modeling  of  the  complicated,  practical  cases.  Three  numerical 
techniques  including  Green’s  function  approach,  implicit  finite 
difference  method  (FDM)  and  explicit  FDM  will  be  presented  for 
solving  the  diffusion  equation.  The  concentration  profile  is  assumed  to 
be  proportional  to  the  profile  of  the  index  change.  Next,  the  index 
profile  is  used  in  the  scalar  Helmholtz  equation  to  solve  for  the  mode- 
field  profiles  as  well  as  the  corresponding  propagation  constants.  The 
WKB  method,  the  matrix  method  and  FDM  for  the  1-D  cases  are 
introduced  along  with  the  effective  index  method  and  FDM  for  the  2-D 
case.  The  beam  propagation  method  (BPM)  is  also  discussed  in  this 
chapter.  The  propagation  of  the  wave  in  various  waveguide  structures, 
with  index  varying  in  the  longitudinal  direction,  can  be  solved  by  this 
method. 

In  chapter  4,  1-D  (planar)  and  2-D  (channel)  waveguide 
fabrication  technique  is  described  in  detail.  In  the  fabrication  process 
for  practical  applications,  single-mode  waveguide,  supporting  only  one 
mode,  is  emphasized  since  most  applications  entail  this  requirement. 
One-dimensional  multimode  waveguides  are  fabricated  only  for  the 
purpose  of  determining  diffusion  characteristics.  The  fabrication 
procedure  for  1-D  waveguides  includes  melt  preparation,  sample 
preparation,  prebaking,  diffusion,  cleaning  and  polishing.  Similar 
procedures  are  applied  to  fabricate  2-D  waveguides  except  that  extra 
steps,  namely,  mask  preparation,  photolithography,  etching,  etc.,  are 
needed  before  prebaking  in  the  2-D  case.  Procedures  to  measure  the 
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mode  indices  of  planar  waveguides  and  the  mode  field  of  channel 
waveguides  are  also  illustrated  in  chapter  4.  The  index  profiles  are 
reconstructed  from  the  measured  mode  indices  by  the  inverse  WKB 
method  where  from  the  diffusion  coefficient  is  determined.  The 
knowledge  of  the  diffusion  coefficient  and  the  corresponding  diffusion 
parameters  is  very  helpful  in  the  modeling  of  the  waveguide  devices. 
The  mode  fields  and  losses  of  the  channel  waveguides  are  measured  by 
the  near-field  measurement  setup,  also  described  in  this  chapter. 

A systematic  modeling  of  the  tapered  waveguides  is  presented  in 
chapter  5.  As  shown  in  Fig.  1-1,  a tapered  waveguide  has  small 
dimension  at  one  end  and  large  dimension  at  the  other  end  so  that  it 
may  provide  a smooth  transition  between  two  devices  with  different 
sizes.  Smooth  transitions  usually  have  lower  radiation  loss  than  abrupt 
junction  since  the  index  profiles  are  continuous  in  the  propagation 
direction.  As  the  density  of  devices  on  the  same  substrate  becomes 
higher,  the  device  dimension  becomes  smaller.  In  connecting  small- 
size  devices  with  fiber,  tapers  can  also  help  reduce  the  mismatched 
loss  at  the  interface.  Thus,  the  realization  and  analysis  of  the  tapered 
waveguides  are  important  steps  before  high-density  integrated  optics 
come  true.  Taper  modeling  requires  simulation  of  1-D  and  2-D 
waveguides.  One-dimensional  simulation  provides  the  necessary 
information,  such  as  the  diffusion  coefficient  as  a function  of 
temperature,  for  the  successive  modeling  of  the  2-D  waveguides  and 
tapers.  Since  the  tapered  guides  are  fabricated  by  a two-step  process, 
the  simulation  is  divided  into  two  steps.  The  first  step  is  the 
simulation  of  straight  channel  waveguides,  i.e.,  solving  the  2-D 
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Fig.  1-1.  Schematic  of  a tapered  waveguide  used  as  an 
optical  interconnect  between  two  devices. 
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diffusion  and  Helmholtz  equations  with  index  profile  invariant  in  the 
propagation  direction.  The  resulting  index  profile  from  the  first  step 
is  fed  into  the  second  step  simulation.  The  tapers  are  fabricated  with  a 
thermal  gradient  and.  therefore,  the  temperature-dependent  diffusion 
coefficient  is  considered  in  the  second  step  simulation.  Furthermore, 
the  radiation  loss  of  tapered  waveguides  with  different  taper  profiles  is 
simulated  by  BPM. 

Chapter  6 consists  of  the  modeling,  fabrication,  and 
characterization  of  the  directional  couplers  as  wavelength 
MUX/DEMUX.  Wavelength  MUX/DEMUX  are  necessary  devices  to 
utilize  the  advantage  of  the  high  bandwidth  of  optical  fibers.  Figures  1- 
2 (a)  and  (b)  illustrate  two  signals  combined  by  a MUX  and  splitting  by 
a DEMUX,  respectively.  Note  that  DEMUX  plays  a function  reverse  to 
that  of  MUX.  They  are  fundamental  building  blocks  toward  high- 
capacity  optical  communication  system.  Wavelength  MUX/DEMUX  can 
be  realized  by  a symmetric  directional  coupler  constructed  by  two 
identical  waveguides,  as  shown  in  Figs.  1-3  (a)  and  (b).  The  design 
considerations  of  MUX/DEMUX,  based  on  spectral  bandwidth, 
crosstalk  and  cross  power  ratio  and  their  dependence  on  the 
fabrication  errors,  are  presented  in  this  chapter.  The  characterization 
setup  for  measuring  the  spectral  response  is  discussed  and  the 
experimental  results  are  used  to  estimate  the  coupling  length  as  a 
function  of  wavelength  and  the  results  are  compared  to  the 
predictions  of  the  modeling. 

The  last  chapter  (chapter  7)  concludes  by  summarizing  the 
results,  while  illustrating  other  applications  of  the  modeling.  It 
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Fig.  1-2.  Block  diagram  of  the  dual-wavelength  MUX/DEMUX. 

(a)  Two  input  signals  of  wavelengths  X-i  and  X2  are  combined 
at  the  output,  (b)  Mixed  wavelengths  and  X2  are  split  at 
the  output. 
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Fig.  1-3.  Symmetric  direction  coupler  as  a wavelength  MUX/DEMUX. 

(a)  The  energy  of  the  signal  A.]  transfers  to  the  other  channel 
and  mixed  with  the  signal  X.2at  the  output,  (b)  Mixed  signals  Xi 
and  X2are  separated  at  the  output. 
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also  describes  the  future  research  work  utilizing  the  well-developed 
ion-exchange  technique  to  fabricate  other  practical  devices  useful  in 
optical  communication  systems. 


CHAPTER  2 

ION-EXCHANGE  IN  GLASS 
2.1.  Introduction 

The  study  of  the  ion-exchange  process  is  important  since  the 
concentration  profile  of  the  diffused  ions  in  the  ion-exchange  process 
determines  the  index- change  profile  in  the  glass  and  therefore,  the 
characteristics  of  the  waveguide.  As  a result,  a knowledge  of  the  theory 
of  ion  exchange  as  well  as  the  quantitative  analysis  of  the  profile  of  the 
exchanged  ions  is  essential  for  successful  design  and  development  of 
passive  integrated  optical  components. 

Extensive  research  on  ion  exchanged  waveguides  and  devices 
has  been  performed  over  the  past  two  decades  and  several  excellent 
review  papers  [Dore69,  Kara79,  Find85,  Rama88b,  Rama88c,  Ross89] 
are  readily  available.  This  chapter  presents  the  theory  of  ion-exchange 
kinetics  and  its  application  to  the  silver-sodium  cation  system. 
Portions  of  this  chapter  have  been  extracted  from  previously  published 
results  [Dore64,  Rama88b].  The  governing  equation  in  the  ion 
exchange  namely,  the  nonlinear  diffusion  equation,  is  derived  from 
first  principles  using  the  diffusion  of  the  ionic  flux  according  to  the 
Nernst-Planck  equation.  The  inter-diffusion  coefficient  resulting  in  the 
equilibrium  condition  of  the  diffusing  ions  — one  ion  in,  one  ion  out  — 
will  also  be  discussed  in  this  chapter.  Moreover,  several  cases  will  be 
considered  so  that  analytical  results  for  the  concentration  profile  of 
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the  specific  diffusing  ion  can  be  obtained.  The  numerical  methods  to 
solve  the  diffusion  equation  and  the  associated  results  will  be 
described  in  the  next  chapter. 

2.2.  Kinetics  of  Ion  Exchange 

2.2.1.  Basic  Equations  for  Binary  Exchange 

The  basic  theory  of  ion  exchange  has  been  discussed  and  solved 
numerically  by  Helfferich  et  al.  [Helf58,  Ples58,  Helf62]  in  spherical 
coordinates;  it  has  been  applied  to  glasses  and  compared  with  the 
experimental  results  by  Doremus  [Dore64].  In  this  chapter,  only  two 
ions  (species  A and  B)  are  considered  in  the  formulation.  We  assume 
that  the  fluxes  of  exchanging  ions  A and  B are  coupled  solely  by  the 
electrochemical  potential  and  no  chemical  reaction  and/or  bonding  is 
assumed  during  ion  exchange.  The  ionic  fluxes  of  A and  B in  glass  are 
given  [Helf58]  by  the  Nemst-Planck  equation: 

S=H,z1C1e-D1VC,|J22i  , (2.U 

o lnCi 

— * _o  _ i 

where  i represents  ion  A or  ion  B,  is  the  ionic  flux  {m  sec  },  m is 
2 1 1 

the  mobility  {m  V sec  },  z\  is  the  electrochemical  valence,  Ci  is  the 

O — » 

concentration  {m  } of  either  exchanging  ion  in  glass,  £ is  the  local 
electric  field  (V/m}  consisting  of  the  space-charge  induced  field  £s 
and  the  applied  field  £a,  Di  is  the  self- diffusion  coefficient  {m  sec'  }, 
and  ai  is  the  thermodynamic  activity.  The  units  are  shown  in  the 
brackets.  The  first  term  represents  the  ions  driven  under  the  electric 
potential  difference  or  the  drift  term,  while  the  second  term  is  the 
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diffusion  term  and  represents  the  ions  diffused  under  the  chemical 
potential  difference  resulting  from  the  concentration  gradient. 

In  the  above  equation,  we  have  neglected  the  deformation  or  the 
strain  induced  by  the  swelling  or  shrinkage  of  the  glass  network  due 
to  the  exchange  with  larger  or  smaller  ions  in  the  melt  [Sane 78, 
Bran86].  If  the  deformation  is  considered,  one  more  term  proportional 
to  vd  representing  the  velocity  of  the  ions  moving  under  the  strain 
must  be  added,  vq  is  proportional  to  the  time  derivative  of 
r^CA+r|(CB-Co)  in  the  direction  normal  to  the  glass  surface,  where  the 
ri  is  the  radius  of  the  ion  i and  Co  is  the  concentration  of  the  ion  B 
before  ion  exchange.  In  the  case  of  K+-Na+  exchange  system,  this  term 
cannot  be  omitted  since  the  swelling  produced  by  the  larger 
potassium  ions  is  significant  [Tsut86].  Researchers  often  include  the 
strain  effects  in  the  diffusion  term  by  defining  a polarization- 
dependent  and/or  anisotropic  self-diffusion  coefficient  as  is  the  case 
with  potassium  exchanged  glass  waveguides  [Yip85].  This  approach  is 
unrealistic  since  the  strain  induced  term  is  not  proportional  to  the 
gradient  of  the  ionic  concentration.  However,  the  index  change  is 
polarization-dependent  due  to  the  anisotropic  stress  [Tsut88]. 

The  activity  can  be  written  as  ai  = Cim  where  m is  a constant 

representative  of  the  ions  in  silicate  glasses  [Lait57,  Karr62]1.  Thus, 
3lnai/3lnCi  = m and  Eq.  (2.1)  becomes 

<P=  PiQ  £-  m Di  VCi-  (2.2) 


1 


m is  denoted  as  n in  the  references. 
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Only  monovalent  ions  are  considered  in  Eq.  (2.1)  i.e.,  zi=l.  Note  that 
the  flux  ft  in  Eq.  (2.2)  is  a function  of  time  and  position,  so  are  Ci  and 
£.  Di  and  pi  are  temperature-dependent  [Dore64]  and  are  assumed  not 
to  be  functions  of  concentration  Ci  and  therefore  position  and  time 
during  the  diffusion;  however,  this  oversimplification  is  not  justifiable 
in  the  case  of  strong  multi-alkali  effect  [Isar69,  Day76,  Char82b]  which 
will  be  addressed  in  section  2.3.2. 

The  total  charge  in  the  system  must  remain  constant  (Fig.  2-1). 
The  relation  between  the  incoming  ion  concentration  Ca  and  the  out- 
diffusing  ion  concentration  Cb  in  glass  is  obtained  as  follows: 

Ca  + Cb  + Cs  = Co  . (2.3) 

where  Co  equals  the  concentration  of  the  ion  B in  glass  before 
diffusion;  in  other  words,  prior  to  diffusion,  as  shown  in  Fig.  2-1  (a),  Co 
= Cbo.  the  value  of  Cb  at  time  t=0.  Cs  is  the  concentration  of  the  net 
depleted  mobile  ions  at  a given  time  giving  rise  to  space  charge.  From 
Fig.  2- 1(b),  it  follows  that  Cs  = Co  - Ca  - Cb-  Cs  is  zero  only  when  the 
exchange  is  strictly  one-to-one  at  all  times.  It  exists  whenever  two 
dissimilar  species  of  unequal  mobilities  are  involved  in  the  exchange. 
For  example,  when  Na+  in  glass  and  Ag+  in  the  melt  exchange  with 
each  other,  Na+  ions  diffuse  out  of  the  glass  at  a faster  rate  which 
causes  negative  space  charge  to  build  up.  This  induces  a local  field 
which  slows  down  the  depletion  of  Na+  and  accelerates  the  in- 
diffusion  of  Ag+  until  such  time  the  equilibrium  condition  is  reached 
and  one-to-one  exchange  takes  place.  Besides,  the  application  of  an 
external  field  could  also  be  responsible  for  the  creation  of  space 
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Fig.  2-1.  Correlation  of  the  ionic  concentrations  in  ion  exchange  process  for 
Ha<  pB  and  DA  < DB . (a)  At  t=0,  Co=Cb0;  (b)  At  t>0,  Co=  CA+C^+C^. 

The  numbers  listed  illustrate  the  relation  of  the  ionic  concentration 
to  the  space-charge  density. 
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charge,  especially  when  the  mobilities  of  the  exchanging  ions  are 
unequal.  The  space  charge  is  related  to  the  electric  field  by  Poisson’s 
equation  namely, 

V . (er  e)  = ^LCs.  (2.4) 

where  er  is  the  low  frequency  relative  permitivity  (dielectric  constant) 
of  the  isotropic  substrate,  q is  the  monovalent  cation  charge  (equal,  in 
magnitude,  to  the  electronic  charge)  and  e0  is  the  permitivity  in 
vacuum.  er  in  Eq.  (2.4)  does  not  represent  the  dielectric  constant  at 
optical  frequencies,  which  determines  the  refractive  index  of  medium 
and  depends  on  the  glass  composition.  The  negative  sign  on  the  right 
hand  side  of  Eq.  (2.4)  indicates  that  the  total  charge  is  negative  in  the 
region  depleted  of  positive  ions;  in  this  case  Cs  > 0.  When  the  ions 
diffuse  into  the  glass,  the  composition  of  glass  as  well  as  its  dielectric 
constant  changes.  Therefore,  er  is  a function  of  Ca  and  can  be 
empirically  expressed  as  [Volf88] 


£r  - g Ca  + £ro  * 


(2.5) 


where  g is  an  empirical  coefficient  assuming  gCA«ero  and  ero  is  the 
dielectric  constant  before  diffusion. 

Since  the  numbers  of  ion  A and  ion  B are  conserved,  the 
continuity  equation 


V • <t>i  = 


aCi 

at  * 


(2.6) 
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is  applicable.  Equations  (2. 2) -(2. 6)  are  the  fundamental  equations 
governing  the  ion  exchange  process  between  two  monovalent  ions  in 
glass: 


Although  the  unknowns  Ca.  Cb,  and  Cs  are  functions  of  position 
and  time,  we  are  interested  in  finding  the  solution  for  Ca-  There  are 
no  simple  analytical  solutions  for  the  Eqs.  (2.2)-(2.6).  However,  with 
some  assumptions,  these  equations  can  be  rewritten  and  combined 
into  one  differential  equation  with  only  one  unknown,  Ca,  which  is 
assumed  to  be  proportional  to  the  refractive  index  change  in  the  glass. 

2.2.2.  Diffusion  Equation:  Small  Space-Charge  Density 
Approximation 

Since  only  network  modifiers  in  glass  are  involved  in  the 
exchange,  the  total  concentration  Co  of  the  mobile  ions  is  a constant. 
By  substituting  Eq.  (2.3)  into  Eq.  (2.6),  we  obtain 

V • ( (j)A  + (jJs  ) = - ^(Ca+Cb)  = - ^(Co-Cs)  = . (2.7) 

The  above  equation  describes  the  continuity  of  the  space  charge.  In 
the  case  of  one-to-one  exchange,  the  space  charge  density 
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corresponding  to  the  local  induced  field  is  usually  eight  orders  of 
magnitude  smaller  than  the  mobile  charge  density  [Walk83c];  Cs  can 
be  assumed  to  be  quite  small  compared  to  Ca  and  Cb,  so  that 

V-(C  + fi)sO.  (2.8) 


Substituting  Eq.  (2.2)  into  Eq.  (2.8)  and  assuming  constant  self- 
diffusion coefficients  and  mobilities,  after  some  algebraic 
manipulations,  the  divergence  of  the  field  £ is  expressed  as 


_ m (DaV2Ca+DbV2Cb)  - £/(PaVCa+PbVCb) 

V • <E  = 

PaCa+PbCb 


(2.9) 


Since  the  change  in  the  dielectric  constant  is  negligible,  we  can 
conclude  by  comparing  Eqs.  (2.4)  and  (2.9)  that  V • £ is  nonzero  with 
the  result  that  the  space  charge  Cs  is  non-zero.  The  rate  of  change  of 
the  concentration  of  ion  A can  be  derived  from  Eq.  (2.6)  by 
substituting  into  Eqs.  (2.2)  and  (2.9): 


= mDAV2CA  - pA £•  VCA  - PaCaV-  £ 
d t 


= mI)AV2CA 
- pA  £•  V C 


(Db/Da)PaCa  + PbCb 
PaCa  + PbCb 
Pb  (Ca+Cb) 

A PaCa  + PbCb 


(2.10) 


The  normalized  concentrations  (or  the  mole  fractions)  Na  and  Nb  are 
defined  as  Ca/(Ca+Cb)  and  Cb/(Ca+Cb),  respectively.  Equation  (2.10) 
can  then  be  rewritten  in  terms  of  the  inter-diffusion  and  inter- 
mobility coefficients  as 
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3Ca  _ ~ (Db/Da)maNa  + MbNb 

d t a a jiaNa  + MbNb 

- £•  VCA 

MaNa  + MbNb 

= D V2Ca  - 5 E'VCa  . 

(2.11) 

or 

®A  = DV2NA-ji£.VNA, 
0 t 

(2.12) 

where  D is  the  inter-diffusion  coefficient  defined  by  Doremus  [Dore64, 
Dore69]. 


D = mDA  ^aNa  + MbNb 

MaNa  + M-bNb 


(2.13) 


Similarly,  we  can  also  define  inter-mobility  coefficient  p as 


H _ M-a  Mb  _ Ma  4a 

^ HANa  + HbNb  i _ ( i _ — )Na  !-“na'  ,2-14) 

Mb 

kT 

a equals  1-Ma/Mb-  By  applying  the  Einstein  relation  DA  = — pA,  where 

k is  the  Boltzmann  constant  and  T is  the  absolute  temperature,  we  see 
that  Da/(ta=Db/mb  [Dore64].  D,  in  this  case,  has  the  same  form  as  p, 
except  for  the  factor  m.  Equation  (2.13)  then  becomes 

g mPAPB  mPA  mPA 

DaNa  + PbNb  ^_Q_^Aj]\jA  1 - ocNa  • (2.15) 


However,  it  is  well  known  that  the  Einstein  relation  often  does  not 
hold  in  glasses  due  to  several  mechanisms  [Beie85]  and  therefore. 


kT 


requires  modification.  The  modified  Einstein  relation  DA  = f — pA 


includes  a correction  factor  f which  is  usually  less  than  one  [Dore64, 


t 
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Dore69,  Rama88b].  Inspection  of  Eq.  (2.15)  reveals  that  D and  |i  are 
concentration  dependent  for  <xNa*0.  Hence,  Eq.  (2.12)  is  a nonlinear 
diffusion  equation  and  no  closed  form  solution  for  Na  is  possible. 
However,  explicit  analytical  solutions  can  be  obtained  under  certain 
conditions  which  are  described  below. 


2.2.3.  Diffusion  without  Applied  Field 


Under  equilibrium,  the  sum  of  the  two  ionic  fluxes  is  assumed  to 
be  always  zero,  resulting  in  one-to-one  exchange  between  the 
monovalent  ions: 


(2.16) 


Thus,  in  the  case  of  zero  external  field  (<Ea=  0),  the  local  field  £s  due 
to  space  charge  is  given  by 


<ES= Da-Db VNA  = (Da).~ 

M-aNa+P-bNb  IM-a;  1-  ocNa  ' 


(2.17) 


If  a is  zero,  i.e.,  in  the  case  of  equal -mobility  diffusion,  the  field  is  zero. 
Note  that  the  field  increases  as  the  gradient  of  NA  increases, 
indicating  that  the  space-charge-induced  field  is  stronger  at  the 
diffusion  front.  Equation  (2.12)  can  be  simplified  by  substituting  the 
field  in  Eq.  (2.17)  as 

^=V.(DVNa).  (2.18) 


Further  simplifications  of  the  3-D  nonlinear  equation  — Eq.  (2.18)  — is 
possible  in  the  following  cases. 
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a)  (a=0).  In  this  case,  Eq.  (2.18)  becomes  a linear 

equation: 


^ = V • (D  VNa)  = mDAV2NA . 


(2.19) 


b)  NA«1.  The  concentration  of  ion  A in  glass  is  very  small  so 
that  the  denominator  of  Eq.  (2.15)  equals  one.  In  this  case, 
the  equation  is  the  same  as  Eq.  (2.19). 

c)  Da«Db  (ha«Hb  or  a=l).  In  this  case,  the  inter-diffusion 
coefficient  can  be  approximated  as 

n~mPA 

1-Na- 

Equation  (2.18)  becomes 

isr = rit  [v2Na + rk  (VNa  • VNa)]  • <2.20, 

Equation  (2.20)  can  be  reduced  to  Eq.  (2.19)  in  the  special 
case  Na«1  and  |vNaI2«V2Na- 

d)  Two-dimensional  (2-D)  case.  In  the  2-D  case,  Eq.  (2.18)  can 
be  reduced  to 

^Na  _ d . mPA  dNA^  d , mPA  dNAl 
Pt  dx  l-aNA  3x  3y  l-aNA  dy  ' 

The  2-D  modeling  by  the  finite  difference  method  was  first 
presented  by  Wilkinson  and  Walker  [Wilk78].  We  will  discuss 
the  solution  for  the  above  case  in  the  next  chapter. 

e)  One-dimensional  (1-D)  case.  The  further  reduction  of  Eq. 
(2.21)  to  one-dimension  yields 
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dNA  _ d . mPA  dNA. 
3t  dx  1-ccNa  dx 


If  aNA«l,  Eq.  (2.22)  becomes 


9Na 

at 


mDA 


92Na 

dx2 


(2.22) 


(2.23) 


Thus,  the  above  1-D  diffusion  is  valid  in  the  case  of  ion 
exchange  without  applied  field  for  1)  equal  mobility  ions  i.e„ 
no  space  charge;  2)  unequal  mobilities  but  low 
concentration  of  diffusants,  i.e.,  small  space  charge  effect. 

Analytical  solutions  are  available  for  Eq.  (2.23)  in  the  following 
two  cases  of  ion  exchange  in  glass  provided  DA  is  both  position  and 
time  independent.  As  described  earlier,  this  is  valid  as  long  as  we  do 
not  consider  the  multi-alkali  effect. 

a)  Infinite  diffusion  source.  Concentration  NA  at  the  glass 
surface  (x=0)  is  kept  constant  during  diffusion.  Therefore, 
the  initial  and  boundary  conditions  are 

NA(x,t=0)  = 0 , for  x > 0 , (2.24a) 


and  at  the  surface, 

NA(x=0,t)  = Nao  , for  t > 0 . (2.24b) 

At  the  other  boundary  for  infinite  glass  thickness, 

NA(°°,t)=0  , for  t > 0 . (2.24c) 

The  solution  [Dore64,  Cran75]  of  Eq.(2.23)  is 
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NA(x,t)  = NAo  erfc(-^-) , (2.25) 

Weff 

where  erfc(u)  is  the  complementary  error  function  defined 
as 

erfc(u)  = l'-^nj^expl-t^dt  . 

\ 7t 

x 

and  is  shown  in  Fig.  2-2,  where  u = ^7—  is  the  normalized 

w eff 

distance.  At  a given  time  t,  the  effective  width  of  the  profile 
at  u=l,  is 


Weff  = 2YmDAt  , (2.26) 

corresponding  to  the  distance  from  the  surface  (x=0)  to  the 
position  where  NA/ NAo=erfc(l)=0. 1573.  Note  that  the 
effective  width  Weff  of  the  waveguide  is  proportional  to  the 
square  root  of  the  diffusion  time  and  the  self-diffusion 
coefficient.  The  1/e  width  of  this  profile  is  defined  as 
[Rama86]  Wi/e=0.6367Weff.  Figure  2-2  also  illustrates  the 
normalized  plots  of  both  Gaussian  and  exponential  profiles 
along  with  the  normalized  erfc(0.6367  u)  curve  for 
comparison  purposes.  All  of  the  solid-line  profiles  are 
normalized  so  that  the  effective  width  now  corresponds  to 
NA(u=l)=NAo/e.  The  area  under  the  normalized,  dashed 
(complemetary  error  function)  curve  NA(u)  is  proportional 
to  the  total  number  of  ions  diffusing  into  the  glass  and  is 
given  by 


Fig.  2-2 
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Fig.  2-2.  Comparison  between  the  normalized  erfc  function  at  1/e  point,  Gaussian  and 
exponential  functions.  The  dashed  curve  is  erfc(u). 
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oo 


NA(x,t)dx  = NaqW  eff 


2NA(^/mSt 


(2.27) 


Note  that  the  area  is  proportional  to  the  square  root  of  the 

dA.  i 

diffusion  time  implying  that  slows  down  with  increasing 

time.  It  is  clear  that  control  of  the  total  number  of  ions 
diffused  improves  with  longer  diffusion  time, 
b)  Finite  diffusion  source.  The  concentration  Na  of  the 
diffusing  ion  A at  the  surface  (x=0)  decreases  whenever 
these  ions  are  not  replenished  during  diffusion.  As  the 
diffusant  is  exhausted,  the  profile  became  closer  to  a 
Gaussian.  Due  to  the  difficulty  in  terminating  the  diffusion 
exactly  at  the  point  of  exhaustion  of  the  diffusion  and  the 
ambiguity  involved  in  the  profile,  the  problem  is  difficult  to 
analyze.  In  addition,  with  the  erfc  function  as  the  initial 
profile,  it  is  rather  difficult  to  solve  the  problem  analytically. 
However,  it  becomes  easier  if  we  consider  a delta  function 
source  as  the  initial  condition  for  the  diffusion  process.  This 
can  be  illustrated  by  considering  a two-step  process.  To 
achieve  a delta  function  profile  in  the  first  step,  an 
extremely  small  amount  of  diffusion  time  is  required.  In  the 
second  step,  annealing  these  samples  permits  the  diffusion 
of  the  ions  present  just  under  the  surface,  further  into  the 
substrate.  However,  the  area  under  the  curve  A2  for  several 
diffusion  conditions  (Fig.  2-3)  and  hence,  also,  the  total 
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Fig.  2-3.  The  concentration  profiles  of  finite  diffusion  source  for 
W = 1.0,  1.5,  and  2.0. 
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number  of  ions  A in  the  glass  remain  constant.  The 
conditions  for  the  second  step,  which  we  term  as 
postbaking,  are 

NA(x,t=0)  = 0 , for  x > 0 , 

, (2.28) 
and  J^°  NA(x,t)dx  = A2  , for  t > 0 . 

Since  the  initial  (first  step)  concentration  profile  is  a delta 
function,  the  solution  is  a gaussian  [Cran75],  given  by 

NA<x’t)  = t^texp[_(w)2]  ■ (2-29) 

where  W=Wefp=2V mDAt  is  now  the  depth  at  the  1/e  point. 
The  maximum  concentration  position  always  remains  at  x=0 
and  the  maximum  value  decreases  with  increasing  diffusion 
time  and  is  inversely  proportional  to  W,  the  1 /e  width  of  the 
profile.  Further  diffusion,  as  shown  in  Fig.  2-3,  does  not 
alter  the  gaussian  nature  of  the  profile.  The  time  t in  this 
case  represents  the  postbaking  time. 

2.2.4.  Field  Assisted  Diffusion 

An  external  field  can  assist  the  diffusion  and,  therefore,  either 
the  diffusion  time  or  the  diffusion  temperature  can  be  reduced  to 
fabricate  waveguides  [Char78].  Furthermore,  the  field  can  drive  the 
dopants  deeper  into  the  glass  resulting  in  a buried  waveguide,  which  is 
immune  to  surface  scattering,  in  a two-step  diffusion  process,  as 
described  in  chapter  3.  Buried  waveguides  exhibit  lower  losses 
[Chen88]. 
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Since  an  ionic  current  flows  when  an  external  field  is  applied, 
Eq.  (2.16)  is  not  valid  in  the  case  of  an  applied  field.  According  to  Eq. 
(2.2),  the  total  flux  of  both  ions  normalized  to  the  total  mobile  charges 
is  expressed  as 

= Pb(1-ccNa):e  + mDBaVNA,  (2.30) 


where  we  assume  that  the  value  of  m is  the  same  for  the  two  cations. 
Using  Eq.  (2.17)  and  (2.30),  the  total  field  can  be  written  as 


E=  5+  S = -Da«VNA  <t>A+<i>R 

Ma  1-ccNa  Pb(1-ocNa)C0  * 


(2.31) 


■Ea  is  related  to  the  current  density  J as 


£a  = 


(2.32) 


where  f is  Faraday’s  constant  (=96485  Coul/mole)  and  nq  is  the 
Avogadro’s  number  (=6.022X1 023  mole'1).  In  the  case  of  no  applied 
field  (2^=0),  the  current  density  J is  zero.  The  space  charge  Cs  is 
generally  nonzero  in  the  case  of  field-assisted  diffusion  if  a is  not  zero. 
However,  if  NA  is  much  less  than  unity  or  the  mobilities  of  ions  A and 
B are  equal,  then  Eq.  (2.12)  is  still  valid.  The  equation  is  also 
applicable  in  the  case  of  small  applied  fields  with  a negligible  space 
charge  density.  In  these  cases  Eq.  (2.12)  can  be  rewritten  as 

= mDA  V2Na  - pA  e-VNa  , 

a t 


(2.33) 
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where  the  field  is  approximated  by  the  applied  field  £a.  For  one- 
dimensional diffusion,  Eq,  (2.33)  is  reduced  to 


aNA  = mDAa2NA  --^na 


at 


ax2 


dx 


(2.34) 


For  the  conditions  given  by  Eq.  (2.24),  an  analytical  solution  is 
obtained  [Malk61,  Cuch61,  Lili82]  as 

Na  = ^^{erfc(u-u’)  + exp(4uu)  erfc(u+u’)j , (2.35) 

Z 

x Pa  £t 

where  u = u’  = Weff  , and  Weff  is  given  by  Eq.  (2.26).  Equation 

(2.35)  is  plotted  in  Fig.  2-4  as  a function  of  u for  various  values  of  u’. 
u’=0,  corresponds  to  the  case  with  no  applied  field  and  the  profile  is 
an  erfc  function.  For  u’  > 3 i.e.,  in  the  case  of  large  fields,  the  second 
term  on  the  right  hand  side  of  the  above  equation  is  negligible 
[Rama86].  Thus, 

Na  = erfc(u-u’) . (2.36) 

The  width  of  the  waveguide  at  the  half-maximum  approximately  equals 
pA£t  for  u’  > 3.  This  depth  of  diffusion  at  the  half  maximum  due  to  the 
diffusion  of  the  ions  into  the  substrate  under  applied  field  is  clearly 
different  from  that  in  the  case  with  no  applied  field,  where  the  depth 
Weff  is  proportioal  to  the  square  root  of  the  diffusion  time.  The  area 
under  the  steplike  curve  for  u’  > 3 approximately  equals  u’Nao  and 
corresponds  to  a value  of  pA£tNAo  ions. 
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Fig.  2-4.  The  concentration  profiles  with  and  without  the  applied  filed  for 

u’=0.0,  1.0,  2.0,  3.0,  and  4.0.  The  solid  line  (u’=0.0)  is  for  Ea=0;  the 
dashed  lines  are  for  Ea*0  and  the  dotted  lines  are  the  approximation. 
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2.3.  Silver-Sodium  Ion  Exchange 

There  are  many  ways  to  fabricate  glass  waveguides.  The 
requirements  of  fabrication  techniques  or  material  systems  to  obtain 
high  performance  waveguide  has  been  reviewed  in  detail  in  Ref. 
[Pitt81,  Pitt82].  As  discussed  earlier,  ion  exchanges  with  different 
exchanging  ion  pairs  have  been  used  to  fabricate  optical  waveguides  in 
glass;  for  example,  Ag+-Na+  [Gial73,  Gede75a,  Sava76,  Gall76,  Stew77, 
Walk83c,  Lagu84b],  K+-Na+  [Gial73,  Yip84,  Gort86a,  Naja88b,  Mili89], 
Tl+-K+  [Izaw72,  Land85],  Cs+-Na+  [Neum79],  Cs+-K+  [Ross86],  Li+-Na+ 
[Char80],  etc.  The  index  changes  for  the  above  cation  pairs  are  listed 
in  Ref.  [Char82a,  Find85,  Rama88b].  Among  them,  the  silver-sodium 
ion  exchange  is  one  of  the  most  attractive  ion  pairs  since  1)  it  is  much 
less  toxic  compared  to  Tl+,  2)  the  diffusion  can  be  carried  out  at 
relatively  low  temperatures  due  to  the  relatively  large  diffusion 
coefficient  of  the  silver  ions,  3)  a broad  range  of  index  changes  (up  to 
0.1)  is  achievable  in  soda-lime  glass  simply  by  changing  the 
concentration  of  the  silver  ions  in  the  glass  [Stew78,  Houd86],  and  4) 
the  ionic  size  of  Ag+  (1.26A)  is  close  to  that  of  Na+  (0.95A)  resulting  in 
negligible  birefringence  [Honk87b].  Single-mode  waveguides  (with 
smaller  index  change)  can  be  fabricated  by  lowering  the  concentration 
of  the  silver  ions  in  the  glass.  The  propagation  loss  in  the  silver- 
sodium  ion-exchanged  waveguides  is  primarily  caused  by  the 
absorption  of  the  reduced  silver  ions  in  the  glass  [Gial73,  Walk83c] 
and  partially  due  to  the  scattering  caused  by  the  surface  imperfections 
and  inhomogeneities  in  the  glass.  The  absorption  occurs  at  495  nm  in 
the  soda-lime  glass  [Faik83]  due  to  the  incorporation  of  the  impurities 
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introduced  in  the  glass  during  fabrication.  The  propagation  loss  can  be 
decreased  by  using  high  quality  optical  glass  as  substrates  with 
considerably  lower  impurity  concentrations.  Ag+-Na+  channel 
waveguides  with  an  attentuation  of  less  than  0.068  dB/cm  at  0.633  pm 
in  BK7  were  first  demonstrated  by  Eguchi  et  al.  [Eguc83].  However, 
the  loss  ( ~2  dB/cm  at  0.633  pm)  is  higher  in  pyrex  glass  [Imai83].  By 
carefully  choosing  of  the  glass  substrate  and  fabricating  the  waveguide, 
a propagation  loss  below  0.1  dB/cm  is  attainable  for  Ag+-Na+ 
exchanged  single-mode  channel  waveguides  [Rama88b]. 

The  Ag+-Na+  exchanged  glass  waveguides  can  be  fabricated  by 
immersing  the  substrate  into  a molten  salt  bath  or  by  the  deposition  of 
a metallic  silver  film  and  diffusion  under  an  external  electric  field 
[Vilj80,  Naja86,  Terv87,  Honk88,  Zeng88].  In  the  latter  technique  a 
silver  stripe  is  first  deposited  on  the  glass.  Diffusion  is  then  carried 
out  at  an  elevated  temperature  with  the  aid  of  an  electric  field.  The 
index  change  can  be  adjusted  by  varying  the  applied  field  [Naja86J. 
However,  the  propagation  loss  is  somewhat  higher  than  that  of 
waveguides  fabricated  by  the  ion  exchange  process  using  the  molten 
salt  bath.  The  ion  exchange  process  using  the  salt  bath  will  be 
described  in  detail  in  chapter  4. 

2-3.1.  The  Equilibrium  Condition  and  the  Index  Change 

When  a glass  substrate  is  immersed  into  a molten  salt  bath,  ion 
exchange  occurs  across  the  interface  when  no  external  field  is  applied. 
The  glass-melt  interactions  were  first  described  in  detail  by  Stern 
[Ster66],  this  was  followed  by  a theoretical  study  of  the  exchange  and  a 
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comparison  to  experimental  results  by  Garfinkel  [Garf68].  The 
exchange  process  reaches  an  equilibrium  at  the  glass-melt  interface 
and  can  be  described  as  follows 

-A-melt+Bgiass  Agiass+Bmelt  , (2.37) 


where  A and  B represent  silver  and  sodium  ions,  respectively.  The 
exchange  process  is  governed  by  the  regular  solution  theory  [Kirk61] 
in  the  melt  and  an  m-type  behavior  in  the  glass  [Karr62].  The 
relationship  between  the  concentration  Na  in  the  glass  and  Na  in  the 
melt  is  given  by  [Garf68] 


Na 

1-Na) 


- In  K, 


(2.38) 


where  H is  the  interaction  energy  of  the  two  cations  in  the  melt,  R is 
the  gas  constant,  T is  the  absolute  temperature,  and  K is  the 
equilibrium  constant,  given  by  [Karr62] 

v - <3-a  <1b 

aBa!’  (2-39) 

and  a.  s without  the  bars  represent  the  thermodynamic  activities  of  the 
ions  in  the  glass  and  those  with  bars  represent  the  activities  in  the 
melt.  Notice  that  a bar  over  the  letter  is  used  for  the  liquid  phase, 
contrary  to  the  notation  in  Refs.  [Garf68,  Chlu87,  Rama88d].  K > 1 
indicates  the  affinity  of  the  glass  for  ion  A i.e.,  the  glass  prefers  the 
intake  of  ion  A and  the  equilibrium  in  Eq.  (2.37)  tends  to  move  to  the 
right  hand  side  [Garf68].  In  this  case,  the  concentration  of  ion  A in  the 
glass  is  higher  than  that  in  the  melt  at  equilibrium  [Chlu87,  Rama88d]. 
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On  the  other  hand,  for  K < 1,  the  glass  prefers  to  retain  ion  B instead. 
The  equilibrium  constant  depends  on  the  diffusion  temperature  and 
the  ion  pair  involved  as  well  as  the  composition  of  the  host  glass.  The 
experimental  method  to  determine  K is  described  in  detail  elsewhere 
[Rama88d].  In  the  case  of  Ag+-Na+  in  soda-lime  silicate  glasses,  K is 
much  larger  than  unity,  usually  in  the  range  of  75  to  120  [Rama88d]. 
Therefore,  the  exchange  equilibrium  tends  to  shift  to  the  right  hand 
side  of  Eq.  (2.37)  resulting  in  a much  larger  normalized  concentration 
of  the  silver  ions  in  the  glass  than  in  the  melt.  For  example,  NAg  in 
soda-lime  glass  is  known  to  be  as  large  as  0.1  for  10'4  MF  of  the  silver 
ion  concentration  in  the  melt  [Chlu87,  Rama88d).  Since  the  silver  ion 
concentration  in  the  glass  is  proportional  to  the  index  change,  the 
control  of  the  silver  concentration  in  the  melt  is  of  paramount 
importance.  In  the  case  of  the  Ag+-Na+  exchange.  An  can  be  expressed 
empirically  as  An  = 0.1NAg  [Rama88d].  The  concentration  of  the 
silver  ions  in  the  melt  can  be  monitored  using  a concentration  cell 
technique  [Lagu84a]  using  an  electrolytic  technique  to  release  the 
silver  ions  through  a silver  electrode  [Lagu84b,  Naja85],  and  feeding 
back  the  measured  signal  to  control  the  electric  current,  thus, 
keeping  the  concentration  of  the  silver  ions  constant  in  the  melt.  The 
electrolytic  release  is  one  of  the  advantages  of  the  exchange  with  silver 
ions  since  high  purity  silver  is  available  in  electrode  form  and  is  a non- 
reactive metal.  The  index  change  can  be  controlled  to  an  accuracy  of 
10  4 by  this  technique.  This  accuracy,  as  shown  by  Millar  and  Hutchins 
[Mill78],  is  required  in  fabricating  passive  waveguide  devices  with  high 
performance. 
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The  refractive  index  of  glass  depends  on  the  glass  composition 
[Hugg40,  Hugg43,  Abou88,  Volf88].  Since  only  network  modifiers  in 
glass  are  involved  in  the  exchange  process,  the  glass  structure  is 
basically  unchanged.  Therefore,  the  index  change  of  the  glass  is 
determined  by  the  concentration  of  the  exchanging  ions  in  the  glass. 
In  the  case  of  Ag+-Na+  exchanged  waveguides  fabricated  using  low  melt 
concentrations,  the  index  change  is  primarily  due  to  the  difference  in 
the  electronic  polarizability  of  the  ions.  Silver  ions  have  a larger 
polarizability  (2.4  A3)  than  sodium  ions  (0.43  A3)  [Find85],  which 
results  in  an  increase  of  the  index  upon  exchange.  The  waveguide 
region  is  usually  a few  to  a few  tens  of  micrometers  deep  and  has  its 
maximum  index  at  the  surface.  However,  as  has  been  pointed  out 
earlier,  the  concentration  of  the  silver  ion  in  glass  is  not  a linear 
function  of  that  in  the  melt  [Garf68];  especially  at  very  low  melt 
concentrations  (10'4  MF)  [Chlu87].  For  single-mode  waveguides  with 
fiber-compatible  mode  sizes,  an  index  change  of  0.01  or  less  is 
required  which  corresponds  to  NAg  = 0.1  in  the  glass  or  N^g  = 2x1  O'4 

MF  in  the  melt  [Chen87,  Rama88a]. 

2.3.2.  Space  Charge  Effect  and  Multi-Alkali  Effect 
Space  Charge  Effect 

As  discussed  previously,  the  space  charge  is  usually  non-zero 
even  when  no  external  field  is  applied  during  the  exchange.  The  space 
charge  density  primarily  depends  on  the  mobility  ratio  between  the 
two  exchanging  ions,  no  space  charge  is  created  if  the  ratio  is  one.  In 
the  case  of  Ag+-Na+  exchange  in  BK7  glass,  the  mobility  of  the  sodium 
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ions  is  about  80  times  larger  than  that  of  the  silver  ions  [Chen90a]. 
Since  the  sodium  ions  have  higher  mobility,  under  the  chemical 
potential  difference  during  the  diffusion  process,  the  movement  of  the 
sodium  ions  from  the  glass  into  the  melt  is  easily  accomplished  leaving 
vacancies  with  net  negative  charges.  An  electric  potential,  therefore, 
develops  balancing  the  diffusion  process.  The  induced  field  affects  the 
movement  of  both  the  silver  and  sodium  ions.  The  electric  potential 
corresponding  to  the  induced  field  opposes  the  chemical  potential 
driving  the  Na+  ions  whereas  it  aids  that  of  the  Ag+  ions.  As  a result, 
the  out-diffusion  of  the  sodium  ions  is  slowed  down  and  the  movement 
of  the  silver  ions  into  the  glass  is  accelerated.  The  ion-exchange 
process  is  affected  by  both  the  diffusion  under  the  chemical  potential 
and  the  drift  by  the  induced  electric  field.  This  effect  is  reflected  in 
the  inter-diffusion  coefficient,  shown  in  Eq.  (2.15).  If  the  denominator 
of  the  right  hand  side  of  Eq.  (2.15)  approaches  zero  viz.,  ocNa  — » 1,  in 
other  words,  both  Da  « Db  and  Na  -»  1 must  be  satisfied.  In  this  case, 
the  inter-diffusion  coefficient  D is  much  larger  than  the  self-diffusion 
coefficient  DA.  This  effect  is  especially  profound  at  the  surface  of  the 
waveguide  since  the  concentration  of  silver  ions  is  maximum  at  the 
surface.  The  diffusion  is  faster  at  the  surface  than  it  is  deeper  into  the 
substrate.  Hence,  a large  concentration  gradient  is  formed  between 
the  region  with  aNA  -4  1 (at  surface)  and  aNA  « 1 (near  the  tail  of  the 
index  profile)  and  this  gradient  gradually  moves  into  the  substrate 
during  the  diffusion  process.  The  index  profile,  in  this  case,  is  no 
longer  a complementary  error  function  as  is  expected  for  the  infinite 
source  diffusion;  instead  it  may  become  a Gaussian-like  (ocNa<1) 
[Chen90a]  or  a step  function  ( e.g .,  ocNA  - 0.999  for  Cs+-Na+  in  soda- 
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lime  glass)  [Neum79]  because  of  the  larger  inter-diffusion  coefficient 
at  the  surface.  It  is  clear  that  as  ocNa  approaches  unity,  the  diffusion 
profile  is  essentially  similar  to  that  of  the  linear  cases  with  the  applied 
external  field  as  seen  later  in  the  numerical  simulation. 

Multi-Alkali  Effect 

The  inter-diffusion  coefficient  is  concentration-dependent  as 
shown  in  Eq.  (2.15)  where  a constant  self-diffusion  coefficient  was 
used  in  the  derivation  since  the  interactions  between  the  exchanging 
ions  in  glass  is  negligible.  However,  the  self-diffusion  coefficients  are 
often  functions  of  concentrations  if  the  two  exchanging  ions  interact 
during  the  diffusion  process.  The  inter-dependence  results  in  the 
correlation  of  the  diffusion  coefficients  of  the  two  exchanging  alkali 
ions  [Isar69,  Day76],  known  as  the  multi-alkali  effect.  Jain  et  al. 
[Jain83]  have  studied  this  effect  rigorously  in  the  case  of  sodium- 
cesium  silicate  glass.  Although  silver  does  not  belong  to  the  alkali 
group  (Group  LA),  it  is  known  to  affect  the  interaction  energy  of  the 
sodium  ions  in  the  glass,  as  reported  in  Ref.  [Neum79].  The  self- 
diffusion  coefficient  of  the  silver  ions  increases  as  the  concentration  of 
the  silver  ions  NAg  increases  as  demonstrated  in  the  Ag+-Na+ 
exchanged  soda-lime  glass  [Rama86].  However,  the  self-diffusion 
coeffient  of  the  sodium  ions  decreases  as  NAg  increases.  This  results  in 
a space-charge  behavior  and  causes  the  concentration  profile  of  the 
silver  ion  in  glass  to  be  closer  to  a Gaussian  function  even  at  low 
concentrations  e.g.,  0.5  MF  [Chen90a]. 
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2.3.3.  Temperature  Dependence  of  the  Self-Diffusion  Coefficient 

Both  the  self-diffusion  coefficient  and  the  mobility  of  the 
exchanging  ions  in  glass  depend  on  temperature.  It  is  well  known  that 
glass  exhibits  different  diffusion  characteristics  above  and  below  the 
glass  transition  temperature  [Ster66,  Dore69].  Kaps  and  Volksch 
[Kaps82]  experimentally  demonstrated  this  temperature  dependency 
in  the  case  of  Ag+-Na+  in  Na20*2Si02  glass;  they  also  measured  the 
sodium  self-diffusion  coefficient  by  isotope  substitution  in  the  same 
glass  [Kaps84],  However,  we  are  only  interested  in  diffusion  below  the 
glass  transition  temperature.  The  temperature  dependence  of  the  self- 
diffusion coefficient  below  the  glass  transition  temperature  follows  the 
well-known  Arrhenius  relation  [Dore64,  Ster66] 

Da  = DAo  exp(-  ^ti)  (2  40) 

where  AH  is  the  activiation  energy,  R is  the  gas  constant,  T is  the 
absolute  temperature,  and  DAo  is  a constant.  In  general,  both  DAo  and 
AH  depend  on  the  glass  composition  and  concentration  of  the  diffused 
ions  as  well  as  the  ion  pair  involved  in  the  exchange.  For  Ag+-Na+ 
exchange  in  BK7  glass,  DAo  and  AH  in  the  above  equation  are  found  to 
be  3.71x10  pm  /min  and  105.1  KJ/mole,  respectively,  with  a melt 
composition  of  10  MF  of  silver  ions,  0.33  MF  of  potassium  ions,  and 
0.67  MF  of  sodium  ions  [Chen90a],  where  m is  assumed  to  be  unity. 
The  activiation  energy  AH  is  98.7  KJ/mole  as  reported  by  Eguchi  et  al. 
[Eguc83].  The  technique  to  determine  DAo  and  AH  will  be  presented 
in  chapter  4. 
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2.4.  Summary 

We  have  theoretically  discussed  the  concentration  profiles  of  ion 
exchanged  waveguides  in  glass  with  or  without  an  externally  the 
applied  electric  field  in  glass.  The  nonlinear  diffusion  equation  is 
derived  from  fundamental  principles  such  as  Fick’s  law,  continuity 
equation  and  Gauss’s  laws.  That  equation  can  be  linearized  if  the 
product  ccNa  is  close  to  zero,  i.e.,  equal  mobilities  of  both  ions  and/or 
low  concentration  of  ion  A in  glass.  Analytical,  closed  form  solutions 
are  possible  for  the  one-dimensional  case  with  or  without  an  applied 
field  in  glass.  However,  in  the  two-dimensional  case,  the  equation 
must  be  solved  by  numerical  techniques,  as  will  be  presented  in  the 
next  chapter. 


CHAPTER  3 

NUMERICAL  MODELING  OF 
PLANAR  AND  CHANNEL  WAVEGUIDES 

3.1.  Introduction 

Planar  waveguides  confine  the  light  in  one  of  the  transverse 
directions,  namely  the  thickness  direction,  whereas  channel 
waveguides  confine  the  light  in  both  transverse  directions.  These 
waveguides,  channel  waveguides  in  paricular,  are  the  basic  building 
blocks  of  sophisticated  guided-wave,  integrated  optical  devices  for 
applications  in  optical  communication  and  signal  processing  systems. 
The  understanding  of  the  waveguiding  characteristics  of  both  the 
planar  and  channel  waveguides  is  essential  in  the  design  of  high 
performance  devices. 

Prior  to  solving  the  problem  of  wave  propagation  in  waveguides, 
we  must  first  solve  the  nonlinear  diffusion  equation  discussed  in  the 
previous  chapter  to  determine  the  index  profiles  of  both  the  1-D  and 
2-D  cases.  The  diffusion  profile  depends  upon  the  cation  pairs,  the 
characteristics  of  the  host  glass  and  the  fabrication  conditions; 
analytical  solutions  applicable  to  planar  waveguides  in  certain  cases 
were  presented  in  the  previous  chapter. 

The  first  part  of  this  chapter  deals  with  numerical  solutions  for 
the  diffusion  equation  where  closed  form  solutions  are  not  available. 
Nowadays,  exact  solutions  of  the  nonlinear  diffusion  equation  are 
possible  using  numerical  techniques  due  to  the  availability  of  high 
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speed  computers  at  low  cost.  The  modeling  allows  us  to  optimize  the 
process,  tailor  the  index  profile  and  attain  optimal  mode  size  to  match 
other  optical  devices.  Besides,  it  is  quite  helpful  in  the  design  of 
complex  devices.  In  the  later  part  of  this  chapter,  several  numerical 
methods  are  presented  for  solving  the  Helmholtz  wave  equation  for 
both  the  1-D  and  2-D  cases. 

Under  certain  conditions  outlined  in  the  last  chapter,  the 
nonlinear  equation  can  be  linearlized;  often,  depending  upon  the 
characteristics  of  the  cation  pairs  in  the  host  glass  and  their 
concentrations,  the  exchange  process  can  be  treated  as  a single 
species  diffusion.  Closed  form  solutions  are  not  always  available  even 
for  the  linear  case  depending  on  the  initial  and  boundary  conditions. 
The  resulting  diffusion  equation  Eq.  (2.19)  in  such  cases  as  well  as  the 
nonlinear  equation  Eq.  (2.18)  can  be  solved  numerically  by  using  a 
number  of  approaches.  One  such  appproach  that  is  quite  useful  is 
Green’s  function  method  illustrated  in  the  subsequent  section.  This 
method  is  especially  useful  in  the  numerical  solutions  of  1-D  surface 
and  buried  waveguides  as  well  as  2-D  surface  waveguides.  However,  in 
general,  the  Green’s  function  approach  is  not  applicable  to  the 
nonlinear  equation.  In  that  case,  the  well  known  finite  difference 
method  (FDM)  may  be  employed.  In  the  following  subsection,  both 
approaches,  viz.,  the  Green’s  function  and  FDM  are  discussed. 

The  concentration  profile  of  the  diffused  ions  is  assumed  to 
approximate  the  index  profile  of  the  waveguide.  Once  the  index  profile 
is  known,  the  fields  in  the  waveguide  can  be  found  by  solving  the 
Helmholtz  wave  equation.  The  Helmholtz  equation  can  be  solved 
numerically  by  a number  of  well  known  methods:  the  WKB  method. 
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the  matrix  method,  1-D  FDM,  a variational  approach  using  finite 
element  method  (FEM),  perturbation  techniques,  etc.  for  the  one- 
dimensional case  and  the  effective-index  method  (EIM)  [Rama74, 
Hock77,  Chia86,  Veld88],  2-D  FDM  [Ster88a,  Ster88b],  again 
variational  approach  [Mats73,  Mish86,  Haus87b]  with  FEM  [Yeh79, 
Maba81,  Kosh85a,  Kosh85b,  Lagu86b],  perturbation  technique 
[Kuma83],  spectral  index  method  [Mcil89,  Mcil90,  Ster90],  etc.  for 
the  two-dimensional  case.  A comprehensive  review  of  some  of  the 
above  techniques  is  presented  by  Kogelnik  [Koge88].  In  this  chapter 
numerical  approaches  applicable  to  one-dimensional  waveguides  by 
using  WKB,  matrix,  and  FDM  methods  are  illustrated.  This  is  followed 
by  a discussion  of  EIM  and  FDM  techniques,  as  applied  to  2-D 
waveguides. 

In  all  of  the  above  techniques,  we  assume  a z-invariant  [i.e., 
invariant  in  the  direction  of  propagation)  index  profile.  Some 
integrated  optical  devices  involve  tapered  waveguides,  cross-couplers, 
Y-junctions,  etc.  In  these  cases,  the  index  profile  is  z-dependent  and  a 
number  of  numerical  techniques  that  simulate  the  evolution  of  the 
propagating  wave  such  as  the  step  transition  method  [Marc70],  the 
local  normal  mode  approach  [Marc74]  and  the  beam  propagation 
method  (BPM)  [Flec76,  Feit78,  Feit79,  Feit80]  are  applicable.  We 
discuss,  in  particular,  the  BPM  technique  which  is  becoming 
increasingly  popular  for  characterizing  these  waveguide  devices. 
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3.2.  Graded-Index  Profile 

In  this  section,  we  first  solve  the  linear  diffusion  equation  in  the 
one-dimensional  case,  with  (Eq.  (2.34))  and  without  (Eq.  (2.23))  an 
applied  field,  for  the  index  profile  using  both  the  Green’s  function  and 
FDM  approaches.  The  techniques  are,  then,  extended  to  solve  the 
nonlinear  diffusion  equation  for  the  2-D  profiles. 

3.2.1.  Green’s  Function:  One  Dimensional  Case 

For  the  one-dimensional  case,  the  linear  equation  Eq.  (2.34)  is 
considered  in  this  subsection.  For  the  diffusion  without  a field,  £ is  set 
to  zero  and  Eq.  (2.34)  reduces  to  Eq.  (2.23).  Green’s  functions  are 
used  to  solve  Eq.  (2.34)  by  considering  an  impulse  input  function  at  an 
arbitrary  position  at  time  t=0  [Chen87,  Rama88a]. 

Impluse  Response 

Consider  an  arbitrary  concentration  profile  as  the  initial 
condition  NA(x)  = NA(x,t=0);  For  computational  purposes,  we  can 
discretize  the  initial  index  profile  and  sum  the  values  of  NA  at  discrete 
points  located  at  x = xif  i=0,  ±1,  ±2,  etc. 


Na(x)  = £ N£(Xi)-8(x-Xi)  , (3  d 

i 

where  6(x-xi)  is  the  delta  function.  Each  delta  function  5(x-xi)  can  be 
treated  as  a finite  source  at  x=xif  infinite  in  extent  in  both  y and  z 
directions  and  NX(xj)  represents  the  normalized  concentration  of  the 
diffusant  at  x = Xf.  Figure  3- 1(a)  shows  the  initial  concentration  profile 
represented  by  the  delta  functions  at  x=...,x_i,xo,x+i,.... 


etc.  For  each 


NA(x.t=0) 


46 


Fig.  3-1.  (a)  Initial  condition  represented  by  the  delta  functions  8(x-jf ); 

(b)  Normalized  diffusion  profile  after  time  t (dark  lines) 
composed  by  individual  gaussian  functions  A(x-xi)  (light  lines). 
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delta  function  input,  the  solution  of  Eq.  (2.34)  is  given  by  the  Green’s 
function  [Cran75], 


Zl(x-Xi,t) 


- 1 exp  [ - 

V47cmDAt  4mDAt 


(3.2) 


After  time  t,  the  finite  planar  source  diffuses  and  becomes  a Gaussian 

function  in  one  dimension,  with  its  half-width  at  1/e  point  Wi/e  = 

2YmDAt  and  the  amplitude  — = . Summing  up  all  the  impulse 

Wi/eVrc 


responses,  as  seen  from  Fig.  3- 1(b),  the  solution  of  Eq.  (2.34)  with  the 
initial  condition  NA(x)  can  be  obtained. 


NA(x,t)  = £ NA(Xi)4(x-Xi,t) 

i 


y Nj(xj) 

f Wi/eVTC 


exp[-(^M^)2] 

”l/e 


(3.3) 


As  the  number  of  discrete  points  increases,  the  summation  in  Eq. 
(3.3)  can  be  written  as  an  integral 


NA(x,t) 


NA(x')4(x-x',t)dx' . 


(3.4) 


where  A(x-x',t)  is  given  by  Eq.  (3.2).  The  initial  and  boundary 
conditions  are  given  by  Eq.  (2.24).  To  accommodate  the  diffusion  on 
both  sides  of  the  planar  source,  the  limits  of  the  integral  are  chosen 
from  x = -°°  to  +oo.  Furthermore,  for  computational  purposes,  we  will 
assume  the  melt  (x  < 0)  is  replaced  by  a virtual  host  glass  with  the 
same  diffusion  characteristics.  In  other  words,  the  host  glass  is 
assumed  to  be  infinite  in  extent  in  both  directions.  The  integration  is 
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performed  for  very  small  time  steps  t = At.  The  resulting  profile  is 
used  as  the  initial  condition  for  the  subsequent  integration.  The 
process  continues  until  the  desired  time  of  diffusion  t is  reached  by 
successive  integrations  at  intervals  At,  the  sum  of  which  makes  up  the 
time  t.  The  above  procedure  is  especially  suited  for  the  two  step 
diffusion  process,  where  the  diffusion  profile  of  the  first  step  is  used 
as  the  initial  condition  N£(x)  = NA(x,t  = 0)  for  the  second  step  using 
the  Green’s  function  method  as  described  above. 

Surface  Waveguides  with  Inifite  Source— One-Step  Process 

The  above  procedure  can  also  be  used  for  solving  the  first 
diffusion  step  with  either  a finite  or  an  infinite  source.  For  the  infinite 
source,  the  concentration  of  the  diffusants  remains  constant  at  all 
times  at  the  surface  of  the  glass.  Therefore,  we  must  assume  that  the 
boundary  condition  at  x < 0 is  N£(x,t)  = NA0;  note  that  the  value  of  NA0 
does  not  remain  the  same  near  the  surface  with  time  t > 0,  if  Eq.  (3.4) 
is  employed.  As  before,  we  divide  the  time  t into  many  intervals  At  and 
recursively  evaluate  the  integral  for  all  time  steps;  however,  we  need 
to  reset  the  boundary  condition  NA(x  < 0,t)  = NA0  for  each  evaluation 
of  the  integral  at  successive  At’s.  Thus,  the  evolution  of  the 
concentration  profile  is  continued  until  the  final  solution  is  reached. 
The  accuracy  has  been  checked  for  the  infinite-source  case  where  the 
analytical  solution  is  given  by  Eq.  (2.32).  We  use  the  same  values  of  the 
diffusion  parameters  for  both  the  analytical  and  numerical  solutions. 
We  obtained  a better  accuracy  for  £ * 0,  especially  when  the  slope  of 
the  concentration  profile  at  x = 0 is  nearly  zero  as  shown  in  Fig.  3-2  for 
the  curve  identified  as  2.  Curve  1 represents  the  case  without  the  field 


Analytic  solution 
Eq.  (2.35) 
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(x)vN 
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and  corresponds  to  an  erfc  profile.  The  solid  lines  represent  the 
analytical  result  while  the  dashed  lines  denote  the  numerical  solution. 

Buried  Waveguides — Two-Step  Process 

The  surface  waveguides  in  Fig.  3-2  are  fabricated  by  a single-step 
diffusion  process.  To  fabricate  buried  waveguide  by  exchanging  ions  in 
the  molten  salt,  a two-step  process  is  needed.  The  second  step 
process  involves  the  sample  from  the  first  step  being  diffused  in 
molten  salt  bath  containing  no  A (silver)  ions,  again  with  or  without 
the  application  of  an  external  field.  In  this  case,  the  boundary 
condition  is  Na(x  < 0,t)  = 0.  The  concentration  profile  of  the  first  step 
becomes  the  initial  condition  for  the  second  step. 

a)  Arbitrary  electric  field  E2  or  time  t2 . The  numerical  solution 
can  be  obtained  by  following  the  same  procedure  used  in  the 
calculation  of  the  first  step,  i.e.,  integrating  Eq.  (3.4)  now  with 
different  boundary  conditions.  Figure  3-3  shows  the  one-step  diffusion 
profile,  namely,  the  erfc  profile  identified  as  1,  as  well  as  the  second 
step  diffusion  profiles  with  an  arbitrary  applied  field  in  the  second 
step,  identified  by  2 through  4 in  the  figure  with  profile  1 as  the  initial 
condition.  xp  and  W are  the  peak  position  and  1/e  full  width  of  the 
resulting  profiles,  respectively.  Note  that  for  small  applied  fields  E2 
and/or  for  small  diffusion  times  t2  in  the  second  step,  NA(0,  t2)  never 
equals  zero,  but  it  may  be  close,  as  seen  in  Fig.  3-3(curve  2).  Note  that 
Na(x  < 0,t2)  = 0.  However,  with  increasing  t2  for  a given  E2  and/or  for 
large  E2,  NA(0,  t2)  becomes  zero  as  seen  from  Fig.  3-3 (curves  3 and  4). 
It  is  obvious  that  for  achieving  almost  symmetric,  buried  waveguides 
the  application  of  the  field  in  the  second  step  is  necessary. 
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Fig.  3-3.  The  one-step  and  two-step  diffusion  profiles.  In  the  first  step,  the  diffusion  time 

Ti  is  30  min  and  no  applied  field  Ei=0,  denoted  as  1.  In  the  second  step,  the  applied 
E2=  40  V/mm  and  the  diffusion  times  T2  are  15,  30  and  60  min  denoted  as  2,  3, 
and  4,  respectively. 
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b)  Large  E2  and/or  t2 . We  now  assume  that  in  the  second  step, 
the  applied  field  is  sufficiently  large  to  block  the  out-diffusion  of  ion  A 
into  the  melt.  Rewriting  the  position  x and  the  location  of  the  delta 
function  source  x'  in  terms  of  the  normalized  variables  u and  u',  the 
resulting  index  profile,  for  u > again  with  the  boundary 

condition  Na(x  < 0,  t)  = 0,  can  be  obtained  from  Eq.  (3.4)  as 


Na(u)  = j Na  ifc  ^ exp [-(u-u')2] du' , 

Jo 


(3.5) 


where  u = 


x-gA£t2 


t2 


L2 

Weff2  ’ U = Weff2  ’ T = ti*  Weff2  = 2VmDAt2  , ti  and  t2  are 

the  diffusion  time  in  the  first  and  second  steps,  respectively,  and 
f x '\fz^ 

^ A 1 Wefj2  is  concenfration  profile  of  the  single-step  process,  e.g., 

Nai(|#)  = Nao  erfc(—T=^=_)  for  e = 0. 

Weff2  2VmDAti 


3-2-2-  Green’s  Function:  2 and  3 Dimensional  Coses 

In  the  one  dimensional  case,  the  impulse  response  of  Eq.  (2.34) 
was  expressed  by  Eq.  (3.2).  The  Green’s  function  method  can  easily  be 
extended  to  2-D  and  3-D  problem.  In  the  cases  of  2-D  and  3-D 
diffusions,  the  impulse  responses  become  [Cran75] 

4(x-Xl,y-yi,t)  = 1 exp  r - (x-XrPAZxtlMy-y.-pAEyt)2  , 

47tm DAt  4mDAt  J . 


(3.6) 
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and 


4(x-Xi,y-yit 


•exp[- 


?=x,y,z 


(3.7) 


4mDAt 


respectively,  where  T.q  denotes  the  component  of  the  applied  field  in 
the  q (=  x.  y,  or  z)  direction.  The  boundary  conditions  for  2-D  or  3-D 
cases  make  the  numerical  calculation  by  the  Green’s  function  method 
very  complicated.  Since  a considerably  large  amount  of  CPU  time  is 
required  to  evaluate  the  integrals,  the  Green’s  function  approach  may 
not  be  practical.  Furthermore,  the  Green’s  function  approach  is  not 
applicable  for  solving  the  nonlinear  diffusion  equation.  The  finite 
difference  method  is  more  suitable  and  is  described  in  the  following 
section. 

3.2.3.  Finite  Difference  Method  fFDMl 

To  illustrate  FDM,  we  begin  by  considering  the  one  dimensional 
Taylor’s  expansion.  By  truncating  Taylor’s  series,  the  finite  difference 
is  used  to  approximate  the  derivatives  in  the  differential  equations.  Let 
us  employ  this  technique  to  the  one-dimensional  linear  equation  Eq. 
(2.23)  first.  The  second  derivative  of  NA  with  respect  to  x can  be 
expressed  by  the  second-order,  central  difference  method,  as 


(3.8) 


where  the  integers  i+1,  i,  and  i-1  in  the  parentheses  denote  the 
positions  at  x = xi+1,  xlt  and  x^,  respectively,  and  Ax  is  the  difference 
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in  position  between  two  adjacent  grid  points.  To  accommodate  the 
change  in  concentration  NA  with  time,  the  concentration  can  be 
written  as  NA(i,j),  where  i denotes  the  position  xi  and  j corresponds 
to  the  diffusion  time  tj.  Eq.  (3.8)  can,  therefore,  be  rewritten  in  terms 
of  NA(i,j)  as 


The  left  hand  side  of  Eq.  (2.23)  can  again  be  approximated  by  the 
central  difference  method  as 


where  the  derivative  of  the  concentration  NA  in  Eq.  (3.10)  is  evaluated 
at  t = *3+0.5  rather  than  t = tj  or  tj+i  and  identified  by  the  subscript  on 
the  left  hand  side  of  Eq.  (3.10)  as  tj+o.5-  However,  a problem  arises  due 
to  the  fact  that  the  time  t = tJ+0.5  in  Eq.  (3.10)  is  ahead  by  0.5  units 

from  that  in  Eq.  (3.9).  This  difficulty  is  addressed  in  detail  in  the 
following  subsection. 

HD  Linear  Diffusion  Equation,  without  Field 

Equation  (2.23)  describes  the  diffusion  equation  for  the  linear 
case.  As  discussed  above,  Eqs.  (3.9)  and  (3.10)  represent  the 
derivatives  at  two  different  times.  Equating  the  two  equations  with 
different  times 


a2NA 


~ NA(i+ 1 ,j)-2NA(i,j)+NA(i- 1 , j) 
(Ax)2 


(3.9) 


t k.tj+0.5  A t 


(3.10) 


(3.11) 
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may  result  in  larger  truncation  errors.  Crank  [Cran75]  resolved  this 

problem  by  rewriting  RHS  of  Eq.  (3.11)  as  an  average  of  -r-£  between 

dx2 

tj  and  tj+i: 


3Na 

~ mDA 

^2NA 

, 32Na 

1 

9 t 

Xl.tj+o.5  ^ 

^ x2 

x,,tj  ^ x2 

Xi-tj+l; 

(3.12) 


As  a result,  the  truncation  error  is  reduced  for  the  second-order 
central  difference  formula.  The  finite  difference  form  of  Eq.  (3.12)  can 
then  be  expressed  as 


NA(i,j  + l)-NA(i,j)  _ mDA 

[NA(i+l.j)-2NA(i,j)+NA(i-l,]) 

At  2 

1 (Ax)2 

. NA(i+l,j+l)-2NA(l,i+l)+NA(i-l.i  + l)l 

(Ax)2  I ,(3'13) 

where  the  order  of  total  error  Is  0((At)2+(Ax)2).  By  defining  rj  = 
mDA  At  _ 

2 (ax)2  ’ E<*‘  (3.13)  becomes 

fiNA(i- 1 ,j+l)+(- 1 -2ri)NA(i,j+l)+riNA(i+l  ,j+ 1)  = 

-riNA(i-l,j)+(-l+2ri)NA(i,j)-nNA(i+l,j)  . (3.14) 

The  left  hand  side  represents  the  time  t=tJ+i  where  NA  at  position 
x=xi_i,  xi,  and  xi+1  are  unknown,  while  the  right  hand  side  represents 
the  time  t=tj  where  NA  at  x=xn,  xit  and  xi+i  are  obtained  from  the 
calculation  in  the  previous  step.  In  the  case  of  j=0,  NA  is  given  by  the 
initial  condition  at  the  three  positions  x=Xi_i,  xif  and  xi+1.  The 
computation  width  in  the  x direction  is  discretized  say,  into  ac  points. 
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The  boundary  conditions  of  NA,  viz.,  NA(0,  j+1)  and  NA(^+1,  j+1),  at 
the  extremities  x = xo  and  x^+i  at  t=tj+i  are  usually  known.  Therefore, 

the  unknowns  are  NA(l,j+l),  NA(2,j+l) NA(3Cj+l)  and  the  total 

number  of  the  unknowns  is  % A matrix  form  representation  of  Eq. 
(3.14)  can  be  obtained  which  is  easily  solved  for  the  unknowns  by  the 
available  scientific  subroutine  library  such  as  IMSL  or  ESSL. 


Ai^NaO+I)^!  = -B(j+l)*<i  , (3.15) 

where  NA(j)T=[NA(l,j),...,NA(^j)]  , 

the  boundary  conditions  B(j+1)T  =[riNA(0,j+l),0,...,0,riNA(3C+l J+1)], 


-l-2n  n 
n -l-2n 


0 


0 


ri 


ri 

-i-2n 


A2  JQ<X 


■l+2ri  -ri 
-n  -l+2ri 

0 


0 

-ri 

-r\  -l+2ri 


and  the  superscripts  T represent  the  transpose  of  the  matrix.  Notice 
that  is  a tri-diagonal,  symmetrical  matrix.  This  method  is 

implicitly  stable  and  the  truncation  error  in  the  calculation  is 
..  At 

negligible  as  becomes  smaller  [Cran75J.  However,  the  round-off 

error  due  to  the  limited  accuracy  of  the  representation  of  a number  in 
computer  is  accumulated  in  each  time  step,  so  that  the  deviation  from 
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the  exact  solution  becomes  larger  for  smaller  - ^ 2 and  more  time 

(Ax) 

steps.  To  reduce  the  error,  we  use  ~ 10'3  at  the  beginning 

of  the  calculation  and  ~ 0. 1 during  the  final  steps  of  the  evaluation. 

1-D  Linear  Diffusion  Equation,  with  Field 

For  diffusion  with  an  applied  field,  corresponding  to  the  linear 
equation  expressed  by  Eq.  (2.34),  Eq.  (3. 12)  can  be  modified  by  adding 
one  more  term  to  include  the  drift  component: 


3Na 

~ mDA 

fa2NA 

+ a2NA 

' 

3 t 

xl.tj+0.5  2 

3 x2 

xj.tj  d x2 

xi»tj+l; 

+ 


4 

3NA 


3Na 


3N, 


l ^ X ^i+0.5.tj  ^ X Ki-0.5,tj 

3Na 


3 x 


+ 


xi+0.5>tj+l 


3 x 


xi-0.5.tj+l/ 


(3.16) 


where 


3Na 


3N/ 


_NA(i+l,j)-NA(i,i)  , NA(iJ)-NA(i-l,i) 

3 x k+o.s.tj  3 x Ki-o.5,tj  Ax  Ax 


_NA(i+l,,l)-NA(i-l,1) 

Ax 

Similarly,  Eq.  (3.13)  can  be  rewritten  for  the  field-assisted  diffusion  as 


NA(i,j  + l)-NA(i,i)  mD« 

fNA(i+ 1 ,))-2NA(i,j)+NA(i- 1,1) 

At  2 

i (Ax)2 

, NA(i+l,j  + l)-2NA(i,J+l)+NA(i-l,i+l)| 

(Ax)2  j 

^tjAg/NAd-H.jj-NAa-l.i)  . NA(l+l.l+l)-N«fl-l  i + ll) 

4 1 Ax  + ^ I-C3.17-) 
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Let 


oiDa  At 
ri=  2 (Ax)2 


and 


r2= 


At 

4 Ax  ’ 


With  these  substitutions,  Eq.  (3.17)  can  be  rewritten  in  matrix  form 
similar  to  Eq.  (3.15)  with  the  matrices  Ai,  A 2 and  the  boundary 
condition  B(j+1)  now  defined  as 

0 


Ai  = 


-i-2n  T1+T2 
ri-r2  -l-2n 


0 


ri+r2 
n-r2  -l-2n 


A2  = 


-l+2ri  -rj-r2 
-ri+r2  -l+2ri 


0 


0 


-ri-r2 
-ri+r2  -l+2n 


and 


B(J  + l)T=[(n-r2)NA(0,j+l),0,  ...  ,0,(ri+r2)NA(ac+l,j  + l)].  Again  Ai  is  a 
tri-diagonal  matrix  although  not  symmetrical.  The  finite  difference 
method  is  more  flexible  for  various  boundary  conditions  compared  to 
the  Green’s  function  method. 

2-D  Linear  Diffusion  Equation 


For  the  case  of  the  2-D  diffusion  equation,  the  concentration 
array  NaO)  is  illustrated  in  Fig.  3-4,  where  the  dots  represent  the 
mesh  points  and  total  number  of  points  is  (KK‘Ky.  The  matrix  Ai  then 
becomes  a block  tri-diagonal  matrix  with  the  size  of  by  %xXy, 

which  has  the  order  of  3C4  if  tK*=cKy=‘K,  The  size  of  the  matrix  can  be 
reduced  by  employing  the  nonuniform  meshes  in  the  two-dimensional 
case.  However,  the  CPU  time  may  increase  enormously  for  nonuniform 
meshes  for  the  same  size  of  the  matrix  Ax. 
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3Gc  points 


Xy  points 


'Na(I)  -Na(2)  -Na(3)  • 
•Na(%+1)  • 


•NaOGc) 

•Na(2%) 


I Ay 


V ' ■ Na(5Gc!^) 


Ax 

Fig.  3-4.  Two-dimensional  mesh  points.  Ax  and 
Ay  are  the  grid  sizes. 


1-D  and  2-D  Nonlinear  Diffusion  Equation 

The  implicit  finite  difference  method  as  described  above  has  the 
advantages  of  convergence  and  stability.  Yet  it  is  impossible  to  convert 
a nonlinear  differential  equation,  e.g.,  Eq.  (2.22),  into  a matrix  form 
since  the  matrix  operation  is  intrinsically  linear  in  nature.  To  solve  the 
nonlinear  diffusion  equation,  an  explicit  finite  difference  method  with 
iterations  is  introduced.  Iteration  loops  are  used  in  each  time  step  to 
achieve  accurate  results.  The  method  is  described  below. 

Considering  Eq.  (2.22),  the  concentration  Na  at  t=tj+i  can  be 
estimated  as 


at 


at 


Xi.tj+05 


Xi,tj+i 

= 1 
2 


~ d / mPA  3Na' 
3x\l-aNA  dxjhtj 


a 

1 mDA  3Na\ 

+ 3| 

( mDA  dNA\ 

Xi,tj+J 

dx 

^1-ccNa  dx  ) 

xi.tj  dx 

^l-aNA  dx  ) 

(3.18) 
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It  is  assumed  that  NA(t=tj+i)  = NA(t=tj)  for  small  time  step  At  in  the 
above  equation.  The  assumption  is  valid  for  very  small  At.  However,  the 
truncation  error  induced  in  the  calculation  of  each  time  step  may 
accumulate  and  result  in  large  error.  Therefore,  iteration  loops  are 
used  in  each  time  step  to  refine  the  results  so  that  a more  accurate 
solution  can  be  obtained.  The  concentration  NA(t=tj+i)  evaluated  in 

Eq.  (3.18)  is  used  as  an  intial  guess  to  compute  a more  accurate 
NA(t=tj+i)  by 


Xi.tj 


dx 


mPA 

1-ccNa  dx  /xi.tj+i 


. (3.19) 


The  finite  difference  forms  of  Eq.  (3.19)  are  given  by 

dNA 


at 


,NA(i,j+l)-N(i,j) 

Xi.tj +0.5  A t 


(3.20) 


a / mPA  3Na 
3x \ 1-ccNa  dx  1 


ij 

mPA  aNA| 

~ mPA 

dx 

^l-aNA  dx  ) 

Xi,tj+i  (Ax)2 

n 

-mPA  / NA(i+l,j)-NA(i.j) 

•tj  (Ax)2  U-  a[NA(i+l,j)+NA(i,j)]/2 

1 (3  21) 

1-  d[NA(i,j)+NA(i-l,j)]/2) ' (3'21) 

NA(i+l.j+l)-NA(i,i+l) 


NA(i,j+l)-NA(i-l.j+l)  | ro 
1-  a(NA(i,j+l)+NA(i-l,j+X)]/2 / ’ (3’22) 


where  NX  is  the  guessed  concentration  at  t=tJ+i  and  nA  is  the 
unknown  concentration  at  t=tj+i.  The  procedure  is  repeated  by 
substituhng  the  calculated  nA  into  NX  in  Eq.  (3.22)  until  steady  state 
values  of  NA  are  attained.  However,  great  care  must  be  exercised  in 
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the  case  of  large  ri= 


mDAAt 
2 (Ax) 2 


which  is  primarily  determined  by  the 


time  step  At.  As  At  increases,  the  number  of  iterations  increases  and 
the  steady  state  values  of  NA  may  deviate  from  the  exact  values  because 
of  the  accumulation  of  the  round-off  error.  The  solution  may  not 
converge  for  large  ri  if  aNA  is  close  to  one.  The  steady  state  values  for 
each  time  step  can  be  reached  more  quickly  by  using  a smaller  At  with 
less  iterations,  implying  that  more  time  steps  are  required  for  a fixed 
diffusion  time.  The  round-off  error  may  also  accumulate  if  many  time 
steps  are  involved  in  the  calculation. 

Figure  3-5  shows  the  1-D  concentration  profiles  modeled  by  an 
explicit  finite  difference  method  with  a=0.0,  0.5,  0.8,  and  0.9.  For  the 
sake  of  illustration,  DA  is  assumed  to  be  0.015  pm2/min  corresponding 
to  that  of  the  Ag+-Na+  exchange  in  BK7  glass  at  340°C.  As  expected, 
the  profile  is  a complementary  error  function  for  a=0.  As  a increases, 
the  profiles  becomes  more  and  more  gaussian-like.  The  slope  near  the 
surface  (x=0)  gets  smaller  as  a becomes  larger.  Figure  3-6(a)  shows  a 
surface  plot  of  a 2-D  concentration  profile  for  t=720  min,  DA=  0.0042 
pm2/min,  cc=0.9  and  the  mask  width  WM=3.5  pm.  Figure  3-6(b)  shows 
the  contour  equi-concentration  profile  in  the  transverse  dimension. 
The  1/e  width  and  depth  are  7.3  pm  and  3.3  pm,  respectively.  It  is 
obvious  that  the  aspect  ratio  of  the  waveguide  is  larger  than  two.  The 
ratio  determines  the  waveguide  characteristics  as  will  be  explained  in 
chapter  5.  The  2-D  diffusion  is  performed  by  allowing  the  ion 
exchange  to  occur  via  a mask  opening  in  the  glass  substrate  delineated 


Fig.  3-5 
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ovjst/(x)vn 


Fig.  3-5  Concentration  profile  solved  by  FDM  for  a = O.O(erfc),  0.5,  0.8,  and  0.9.  The  diffusion 
coefficient  and  time  are  0.015  pm2/ min  and  720  min,  respectively.  A Gaussian  curve 
is  drawn  for  comparison  with  the  dotted  curve  (oc=0.9). 
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Fig.  3-6.  3-D  concentration  profiles  for  t=720  min.  DA=0.0042  pm2/min, 
a=0.9,  and  ^%=3.5  pm.  (a)  Surface  plot,  (b)  Contour  plot. 
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through  a standard  photolithographic  process  described  in  the  next 
chapter.  The  mask  opening  region  always  has  a constant  concentration 
as  seen  from  the  surface  profile  by  the  flat  top.  As  the  mask  opening 
becomes  narrower,  the  width  of  the  flat  top  decreases  and  the  contour 
lines  are  expected  to  become  semicircles.  In  this  case,  the  aspect 
ratio  of  the  waveguide  becomes  two. 

3.3.  Guided  Wave — Helmholtz  Equation 

It  is  well  known  that  the  optical  wave  is  essentially  confined  in 
the  transverse  directions  and  guided  along  the  longitudinal  direction. 
The  change  in  the  index  profile  is  assumed  to  be  small  and 
proportional  to  the  concentration  profile  in  the  case  of  ion-exchanged 
waveguides.  The  index  profiles  are  obtained  numerically  as  described 
in  the  last  section.  The  knowledge  of  the  behavior  of  the  guided  wave 
is  crucial  to  the  design  of  optical  devices.  Two  approaches  are  usually 
used  to  solve  the  modal  properties  of  one-dimensional  waveguides:  one 
is  geometrical  optics  and  the  other  is  wave  optics.  The  former,  i.e.,  the 
ray  optics,  is  an  approximation  that  is  valid  as  long  as  the  waveguide 
dimensions  are  larger  than  the  wavelength  of  the  light.  The  latter  or 
the  wave  optics  is  always  valid  and  is  particularly  used  to  determine 
the  modal  field  distribution;  nevertheless,  it  is  more  complicated  than 
the  ray  optics.  Since  the  concentration  profile  of  the  dopant  ions  is 
continuously  changed  from  maximum  to  zero,  the  index  in  the 
waveguide  region  is  inhomogeneous  and  graded.  Hence,  we  will  focus 
on  the  graded-index  waveguide  and  emphasize  the  field  distribution  of 
the  guided  waves.  However,  the  WKB  method,  which  adopts  the  ray 
approach,  will  be  discussed  briefly  in  this  chapter  since  the  method  is 
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useful  in  the  characterization  of  the  mode  indices  of  the  planar 
multimode  waveguide. 

3.3.1.  Vector  and  Scalar  Helmholtz  Equation 

The  wave  propagation  in  a source-free,  linear  dielectric  medium 
is  governed  by  the  vector  Helmholtz  (or  the  wave)  equations.  Assuming 
a z-propagating  wave  with  time-  and  z-dependence  expressed  as 

exp(jcot-jpz)  and  e,  |i0  and  (3  independent  of  z,  the  wave  equations 
become  [Koge88] 

VtEt  + Vt |Et  • Vt  In  e)  + (co28jj.0-f32)  Et  = 6 , (3.23) 

and  Vt2Ht  + (Vt  In  e)  x (Vt  x Ht)  + (co2ep0-P2)  Ht  = 6 , (3.24) 

where  Et  and  Ht  are  the  transverse  components  of  the  electric  and 
magnetic  fields  of  the  waveguide  modes,  respectively,  and  depend  on 
the  transverse  coordinates  only;  co  is  the  angular  frequency;  e and  |i0 
are  the  scalar  permittivity  and  the  permeability  of  the  medium,  and  p 
is  the  propagation  constant.  The  transverse  Laplacian  is  defined  as  Vt2 
= dx2  + 9y2  • Note  that  Et  and  Ht  are  independent  of  time  and  z.  The 
corresponding  z-components  of  the  fields  are,  derived  from  V H = 0 

and  V (eE)  = 0 and  given,  in  terms  of  the  transverse  components,  as 
follows; 


jf3Ez  = VfEt  + Et  • Vt  In  e 


(3.25) 


and 


JPHZ  = Vt  Ht . 


(3.26) 


Similarly,  Ez  and  Hz  are  independent  of  time  and  z. 
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For  a slowly  varying  index  medium,  Vt  In  e is  approximately  equal 
to  zero  and  the  gradient  terms  (second  term)  in  the  Eq.  (3.23)  and 


Eq.  (3.24)  can  be  ignored.  The  above  four  equations  are 

rewritten  as 

V?Et  + (co2ep0-p2)  Et  = 0 , 

(3.27) 

V?Ht  + ((02ejj.o-p2)  Ht  = 0 , 

(3.28) 

jpEz  = Vt  Et , 

(3.29) 

and 

X 

N 

II 

< 

r+ 

(3.30) 

In  principle,  we  only  need  to  solve  the  above  equations 

for  either  Et  or 

Ht.  The  other  field  vector  can  be  calculated  by  using  Maxwell’s  curl 

equations 

V x E = -jcopoH 

(3.31) 

and 

V x H = jcoeE  . 

(3.32) 

The  transverse  components  in  x and  y directions  are  easily  decoupled 
using  separation  of  variables.  As  a result,  Eq.  (3.27)  and  Eq.  (3.28) 

become 

fVtEx  + (w2ep0-p2)  Ex  = 0 
|Vt2Ey  + (a)2sp0-p2)  Ey  = 0 

(3.33) 

V?HX  + (co2ep0-p2)  Hx  = 0 
Vt2Hy  + (co28p0-P2)  Hy  = 0 • 

(3.34) 

For  a two-dimensional  surface  waveguide,  with  the  y-direction  parallel 
to  the  surface  of  the  substrate  Ex/Ey  is  close  to,  but  not  equal  to,  zero 
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for  the  quasi-TE  modes  and  similarly,  Hx/Hy  is  nearly  equal  to  zero  for 

the  quasi-TM  modes.  Thus,  the  scalar  Helmholtz  equation  is  expressed 
as 

V?F  + (co2ep0-p2)  F = 0 , (3.35) 


where  F represents  Ey  for  the  quasi-TE  mode  or  Hy  for  the  quasi-TM 

mode.  In  the  above  equation,  F and  e are  functions  of  x and  y and  e is 

assumed  to  be  a slowly  varying  function  of  x and  y.  By  substituting  Vt2  = 
3^  32 

9^2  + scalar  Helmholtz  equation  becomes 


d2F  32F 
dx2  dy2 


+ k§(n2-N2)F  = 0 , 


(3.36) 


where  n is  the  index  of  refraction;  the  mode  or  the  effective  index  N = 
P/k0  and  k^  = co2e0p0.  in  the  case  of  one-dimensional  waveguide,  the 
above  equation  can  be  further  simplified  as 

+ k§(n2-N2)F  = 0 . (3.37) 

axz 

In  the  following  sections,  numerical  methods  will  be  presented  to 
solve  Eq.  (3.36)  for  2-D  waveguides  and  Eq.  (3.37)  for  1-D  waveguides. 

3.3.2.  One-Dimensional  Waveguides 

One-dimensional  (planar)  waveguides  are  assumed  to  confine  the 
energy  in  the  x-direction.  The  study  of  multimoded,  planar  waveguides 
helps  us  characterize  the  index  profile  and  correlate  it  to  the  diffusion 
profile.  Knowledge  of  the  index  profile  is  essential  to  the  design  and 
fabrication  of  the  waveguides. 
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WKB  Method 

The  well-known  WKB  method,  used  to  obtain  approximate 
solutions  of  the  Schrodinger  equation,  can  be  applied  to  Eq.  (3.37)  to 
solve  for  the  modes  in  a multimode  waveguide  with  slowly  varying 
index  profile  n(x).  Once  the  mode  indices  (N)  in  the  wave  equation  are 
solved,  Eq.  (3.37)  becomes  an  ordinary  differential  equation  and  the 
fields  can  be  obtained  by  solving  the  equation  using  a numerical 
technique,  such  as  the  Runge-Kutta  method. 

According  to  the  WKB  method,  the  characteristic  equation  for 
the  mth-order  mode  is  given  by  [Gord66] 

fXm  1 /o 

2kc  I [n2(x)-Nm]1/2dx  - 2<|>b  - 2<j)c  = 2mn,  (3.38) 

JO 

where  xm  is  the  turning  point  for  the  mth  mode;  -2<t>b  and  -2<J>C  are  the 
phase  shifts  at  the  turning  point  and  at  the  cover-guide  interface.  It  is 
assumed  that  the  waveguide  is  a graded-index  waveguide  with  its 
cover-guide  interface  at  x=0.  In  the  ray  picture  (Fig.  3-7),  the  left  hand 
side  of  Eq.  (3.38)  represents  the  total  transverse  phase  shift  the  ray 
suffers  in  one  round  trip  across  the  guide.  This  must  equal  2m7t  to 
achieve  constructive  interference.  At  the  turning  point  xm,  the  ray 
reverses  its  direction  and  travels  back  to  the  surface  of  the  waveguide; 
the  index  at  that  point  equals  the  mode  index,  i.e., 

Nm  = n(xm)  . (3.39) 

Figure  3-7  shows  the  graded-index  profile  of  the  waveguide  on  the  left 


Total  Internal 
Reflection 

N(XnJ=Nm  ns  (j)c~  7^/2  Air 
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X, 

C 


Fig.  3-7.  Ray  diagram  for  the  graded-index  waveguides. 
The  index  profile  is  shown  on  the  left.  'Hie 
notations  are  defined  in  the  text. 
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and  the  curved  path  traversed  by  the  ray  in  the  waveguide  on  the  right. 
At  the  cover-guide  interface,  again  it  is  reflected  back  due  to  total 
internal  reflection.  The  phase  shift  0C  at  the  cover  -glass  interface 
equals 


where  nc  and  ns  are  the  cover  index  and  the  surface  index, 
respectively,  and  r=l  for  the  TE  mode  and  r = nl/ng>  i for  the  TM 
mode.  <t>c  for  the  TM  mode  is  always  larger  than  that  for  the  TE  mode. 
If  the  mode  index  Nm  is  close  to  ns,  then  0C  s jt/2.  In  the  case  of  highly 
asymmetrical  waveguides,  i.e.,  nb-nc»ns-nb,  0C  is  also  approximately 
equal  to  k/2,  where  nb  is  the  substrate  index.  0b  approximately  equals 
k/4  [Gord66]  as  the  ray  approaches  grazing  incidence  at  x = xm  with 
the  index  approaching  the  mode  index.  However,  this  approximation 
may  give  erroneous  results  for  the  modes  close  to  cutoff  [Sriv87a]. 
Since  Nm  is  the  only  unknown,  Eq.  (3.38)  can  be  solved  numerically 
for  a given  index  profile.  The  accuracy  of  the  this  method  depends  on 
how  accurate  we  can  estimate  the  phase  shifts  0C  and  0b  if  no 
approximation  is  used.  For  the  three-layer  slab,  step-index  waveguides, 
the  approximations  of  0b  = ft/ 4 and  0C  = it/ 2 are  not  accurate  enough  and 

the  exact  expressions  for  the  phase  shifts  must  be  used.  Thus,  Eq. 
(3.38)  becomes 


(3.40) 


2k0W  Vn|-Nm  - 20b  - 20c  = 2mn, 


(3.41) 


where  W is  the  width  of  the  waveguide;  0C  is  expressed  in  Eq.  (3.40) 
and  0b  is  obtained  by  substituting  0b  and  nb  with  0C  and  nc  in  Eq. 
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(3.40),  respectively.  No  analytical  solutions  for  the  above  equation  are 
available.  However,  Eq.  (3.41)  can  be  normalized  [Koge74]  as 

Vfl-5  = mn  + <()b  + <|>c  , (3.42) 

where  the  normalized  frequency  V = k0wV  ns2-nb2  ; the  normalized 
mode  index  b h ^ tan'll +qM)^^  ; <|>c  = 

1+(1m ...  / a+b  nb2-nr2 

tan  (Ha qM)  V Hb  : ^ asymmetry  a = 2 2 and  qM=  0 for  the  TE 

ns  nb 

, rig 

mode  and  qM=  2 "1  for  the  TM  mode.  The  asymmetry  a is  zero  for 
nb 

the  symmetrical  waveguides  (nb  = nc).  In  general,  a is  large  for  air- 
covered  and/ or  small-index-change  guides.  Figure  3-8  shows  b versus 
V for  the  TE  modes  for  a=0,  1,  10,  and  °°.  The  normalized  mode  index 
b is  zero  for  the  mode  close  to  cutoff  and  is  also  zero,  b approaches 
unity  for  the  well-confined  mode  and  both  (j)^  and  <j)c  are  near  7t/2. 
There  is  no  cutoff  for  the  fundamental  mode  (m=0)  in  the  case  of  a=0. 
The  mode  index  Nm  of  the  TE  mode  is  always  larger  than  that  of  the 
TM  mode  with  the  same  mode  number  m in  the  case  of  an  isotropic 
medium.  However,  the  mode  indices  are  the  identical  for  the  TE 
modes  and  the  TM  modes  in  the  case  ns->nb  and  nc*nb  since  qM 

approaches  zero  for  the  TM  modes. 

Matrix  Method 

Equation  (3.42)  can  be  used  to  obtain  the  mode  index  of  a three- 
layer  slab  waveguide  which  has  the  simplest  structure:  the  substrate, 
film  and  cover  layers.  In  this  subsection  we  consider  multilayer  slab 
waveguides  which  can  be  5-layer  directional  couplers  or  discretized 
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Fig.  3-8.  The  normalized  mode  index  (b)  versus  the  normalized  frequency  0 
characteristics  of  3-layer  step-index  waveguides  for  mode  m = 0, 

2,  and  3.  The  asymmetry  factors  are  0,  1,  10,  and  «,  respectively, 
from  left  to  right  for  each  mode. 
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graded-index  waveguides.  2X2  matrix  method  will  be  employed  here 
to  solve  the  mode  indices  as  well  as  the  mode  fields  of  a multilayer 
waveguide.  This  method  can  be  used  to  solve  not  only  the  multilayer 
waveguides  but  also  the  multilayer  transmission  and  reflection 
problems. 

For  the  TE  modes  of  a planer  waveguide,  the  components  Ez,  Ex, 
and  Hy  are  zero.  The  coordinate  system  is  defined  in  Fig.  3-9  which 
shows  a sketch  of  a multilayer  waveguide,  where  superscripts  *+’  and 
represent  the  positive  and  negative  traveling  wave,  respectively;  Ei 
and  nA  are  the  amplitudes  of  the  field  and  the  index  in  layer  i.  d2,  d3>... 

are  the  thicknesses  associated  with  layer  2,  3 In  this  case,  Eq. 

(3.37)  becomes 


d% 

dx2 


+ k§(n2(x)-N2)Ey  = 0 . 


(3.43) 


Ey  is  the  transverse  electric  field.  The  H fields,  in  terms  of  Ey,  are 
given  by 

H - _ P p 

x"  COllo^7'  (3.44) 

IT  j dEy 

and  Z ~ wp0  dx  ’ (3.45) 

where  p is  the  propagation  constant.  The  solution  of  Ey  in  layer  i can 
be  expressed  as 


Ey (layer  i)  = E f e~Jk>x  + El  eJkiX, 


< x ^ a',  = £ 


m=2 


dm 


(3.46) 
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Fig.  3-9  Multilayer  structure  of  waveguide. 


where  ki  is  defined  as  k0V n^-N2  . The  first  term  of  RHS  of  the  above 

equation  is  a positive-x  propagating  wave  and  the  second  term  is  a 

negative-x  wave.  The  solutions  of  Eq.  (3.43)  are  oscillating  fields  for  m 

> N.  On  the  other  hand,  the  solutions  become  exponential  functions 

for  ni  < N and  imaginary  ki.  The  boundary  conditions  require  that  the 

dE 

tangential  components  Ey  and  Hz  (or  — in  Eq.  (3.45))  be  continuous 

at  the  interface  of  each  layer.  Therefore,  at  the  (i-l)th  interface  the 
field  must  be 

Ey(x=dj.j)  = Eh  e'Jki-ldM  + Ej.  i glki-idu 
= EJ  e'Jkid>-i  + El  eJkid>-i 


(3.47) 
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-£0|a.oHz(x-di.1)  — -ki-iEf-i  e'Jkl-ldi-i  + ki_ lEj-i  eJ^*-idn 

= -kjEf  e'Jkdi-i  + kjE'i  dkldi-i  . (3.48) 


From  Eqs.  (3.47)  and  (3.48),  we  can  obtain 


where 


Mi  = l 
2 


/ 

fl  + k. 

ki_i 

| gjfki-i-kiJdf 

i 

rH 

gjlki-i+kiJd,.!  ' 

1 

i-k> 

p-jlki-i+kildf  | 

i+k> 

| p-Kki-i-kiJdf 

V 

ki-i/ 

ki-ij 

1 / 

(3.49) 


(3.50) 


For  a structure  consisting  of  layers. 


(3.51) 


A 2x2  matrix  M correlates  the  fields  between  the  first  layer  and  the 
last  layer.  The  matrix  is  determined  by  the  parameters  of  the 
waveguide,  i.e„  the  indices  and  the  thickness,  and  independent  of  the 
fields.  In  the  case  of  the  waveguide  problem,  the  fields  must  be 
confined  in  the  guiding  layers  and  exponentially  decaying  in  the  outer 
layers.  Therefore,  the  amplitudes  Ef  and  E^+  are  zero.  To  solve  for  the 
mode  index,  an  arbitrary  value  of  B*  is  substituted  into  Eq.  (3.51).  Ex+ 
and  Ef  can  be  evaluated  by  choosing  different  trial  mode  indices  until 
Ef  is  zero  for  a trial  mode  index.  The  trial  value  is  one  of  the  mode 
indices  of  the  waveguides.  The  mode  field  is  obtained  by  substituting 

all  the  calculated  amplitudes  Ex E*  into  Eq.  (3.46).  The  matrix 

method  is  also  useful  for  solving  for  the  fields  of  the  radiation  modes. 
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In  the  case  of  complex-index  systems,  i.e.,  lossy  or  gain  media,  the 
method  is  applicable  as  well.  However,  the  mode  index  may  also  be  a 
complex  number  in  this  case. 

For  the  TM  modes,  the  components  Hz,  Hx,  and  Ey  are  zero.  The 
scalar  Helmholtz  equation  becomes 

f]2rf 

-r~f  + k§(n2(x)-N2)Hy  = 0 (3.52) 

ax2 


and  the  E fields  are  given  by 


and 


Ev  = 


Ez  = 


: P Hy 
(D£  y 

(3.53) 

j dHy 
©e  dx  • 

(3.54) 

Similar  expression  for  the  adjacent  H fields  can  be  obtained 


(Hf.i 

\Hj_i 


= Mi 


(3.55) 


where 


/ 

1 +ni-lki|  gjJki-i-kiJdM 
n?ki_i/ 

fl.nf.iki) 
l n?ki_i) 

gjlki-i+kiJdj.! 

1 _ ni-lki  \ p-j(kj-i+ki)dj.i  1 

'i+shk' 

n?ki.i/ 

nfki.ii 

1 / 

(3.56) 


Since  the  tangential  components  Hy  and  Ez  are  continuous,  according 
to  Eq.  (3.54),  is  continuous.  As  a result,  the  slope  of  Hy  is  not 

continuous  if  the  index  is  discontinuous. 

The  matrix  method  is  usually  more  accurate  than  the  WKB 
method  for  the  TE  modes  as  well  as  for  the  TM  modes.  The  mode 
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fields  are  easily  obtained  by  the  matrix  method.  This  method  is 
applicable  to  both  the  step-index  and  the  graded-index  waveguides  if 
the  step  indices  are  used  to  approximate  the  discretized  graded-index 
profile.  As  mentioned  earlier,  it  can  be  used  to  solve  for  the  modes  in 
complex-index  or  lossy  waveguides.  However,  this  method  may  need 
more  CPU  time. 


Finite  Difference  Method 

As  described  earlier,  the  nonlinear  diffusion  equation  can  be 
converted  into  a finite  difference  form  and  solved  by  iterations. 
Similarly,  the  scalar  Helmholtz  equation  can  be  converted  into  a finite 
difference  expression.  However,  the  Helmholtz  equation  involves  the 
following  unknowns:  the  propagation  constant  or  the  mode  index  as 
well  as  the  mode  field  as  shown  in  Eq.  (3.37).  The  mode  indices  have 
discrete  values  and  each  of  them  corresponds  to  a mode  field.  In  this 
subsection,  the  application  of  the  finite  difference  formula  to  the 
Helmholtz  equation  will  be  described  and  compared  to  the  methods 
mentioned  in  the  last  subsection. 

Let  us  consider  the  first  term  in  Eq.  (3.37),  i.e.,  d2F/dx2,  which 
can  be  represented  in  the  form  of  Eq.  (3.8).  By  rewriting  Eq.  (3.37), 
one  can  obtain  the  expression  at  x=xj: 


F(i+l)-2F(i)+F(i-l) 

(Ax)2 


+ kg(n2(i)-N2)F(i)  = 0 


Fi  + ^F1+1  = b Ft , 
v2 


(3.57) 


or 


(3.58) 
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where  the  field  Fi=F(i)=F(x=xi);  v2=k02(ns2-nb2)(Ax)2;  fi E-2'nb2  ; the 

ns  -nb 

N2  n 2 

index  ni=n(i)=n(x=xi)  and  g ^2  • ns.  nb  and  b,  as  defined  earlier, 

ns  -nb 

are  the  maximum  index,  the  substrate  index  and  the  normalized  mode 
index,  respectively.  It  is  known  that  the  guided  field  is  zero  away  from 
the  waveguide  region.  Therefore,  it  is  reasonable  to  assume  that  Fo=0 
and  F^+1=0  at  the  calculation  boundaries,  as  shown  in  Fig.  3-10,  where 
‘K.  is  the  number  of  the  discretized  points  of  the  field.  For  i=l  to  % Eq. 
(3.58)  can  be  expressed  as 


1 

0 

S? 

1  

~Fi~ 

"Fi" 

s dz  ■ 

f2 

f2 

. . . 

• 

= b 

• 

• • s 

• 

1 

0 

1 

-Fat. 

-Fat. 

(3.59) 


where  s = ^ and  <£=fi-^2  . For  the  nontrivial  solutions  of  the  field,  the 


normalized  mode  index  b must  be  an  eigenvalue  of  the  tri-diagonal 
matrix  which  is  shown  in  Eq.  (3.59).  The  eigenvectors  of  this  matrix 
are  the  mode  fields  of  the  waveguide.  The  eigenvalues  and  vectors  can 
be  solved  by  commercially  available  subroutines  e.g.,  IMSL.  All  the 
positive  eigenvalues  are  the  b values  of  the  guided  modes  and  the 
maximum  eigenvalue  and  eigenvector  correspond  to  the  b value  and 
the  field  of  the  fundamental  mode,  respectively. 

The  finite  difference  method  can  yield  all  the  mode  indices  as 
well  as  the  mode  fields.  Since  we  assume  that  the  fields  vanish  at  the 
calculation  boundaries,  only  the  guided  modes  can  be  solved  by  this 
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Fig.  3-10.  1-D  field  profile  with  the  assumption  F0  and  F5 
using  in  FDM. 
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method.  Attention  must  be  given  to  calculation  of  the  modes  close  to 
cutoff,  where  the  fields  may  extend  away  from  the  guided  region.  In 
this  case,  more  points  and  larger  calculation  boundaries  must  be 
taken.  Compared  to  the  matrix  method,  this  method  can  not  solve  the 
TE  and  TM  modes  without  modifications  of  Eq.  (3.59).  For  the  step- 
index  waveguides  with  a couple  of  layers,  the  number  of  points  can  not 
be  reduced  as  in  the  case  solved  by  the  matrix  method.  However,  for 
the  graded-index  waveguides  with  real  indices,  the  finite  difference 
method  can  give  quick  results  for  all  the  modes.  The  implementation 
of  the  computer  program  for  the  finite  difference  method  is  quite  easy 
as  long  as  the  scientific  subroutine  package  is  accessible.  The 
application  of  this  method  to  the  two-dimensional  problems  is 
somewhat  straightforward  compared  to  the  WKB  and  matrix  methods 
as  will  be  explained  in  the  next  section. 

3.3.2.  Two-Dimensional  Numerical  Method 

Most  guided-wave  optical  devices  consist  of  two-dimensional 
channel  waveguides.  Only  a few  special  2-D  index  profiles  have  explicit 
analytic  solutions,  e.g.,  step-index  and  parabolic-index  circular  optical 
fibers  [Marc79].  Thus,  numerical  methods  are  very  important  for  an 
understanding  of  2-D  guided-wave  behavior  in  rectangular  geometry. 

Two  methods  are  introduced  in  this  section.  The  first  is  the 
effective  index  method  and  the  second  is  the  finite  difference  method. 

Again,  the  emphasis  is  put  on  the  scalar  Helmholtz  equation  for  the 
two-dimensional  case,  i.e.,  Eq.  (3.36). 
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Effective  Index  Meth nri 

The  effective  index  method  [Rama74,  Hock77,  Chia86,  Veld88] 
is  popular  and  easy  to  implement  for  the  2-D  channel  waveguides.  This 
method  is  mainly  used  to  calculate  the  mode  indices  of  the 
waveguides.  By  converting  a 2-D  index  profile  into  its  1-D  equivalent 
profiles,  the  mode  index  can  easily  be  evaluated  since  the  numerical 
methods  for  the  1-D  planar  waveguides  are  available  as  discussed  in 
the  last  section.  In  general,  solving  the  1-D  equivalent  index  profiles  is 
much  easier  than  solving  the  2-D  index  profile  directly. 

The  method  is  briefly  described  below  [Veld88].  First,  we 
consider  the  scalar  Helmholtz  equation,  Eq.  (3.36).  Assume  that  the 
modal  field  F(x,y)  can  be  written  as 


F(x,y)  = Fi(x,y)  F2(y)  , 


(3.60) 


where  F^x.y)  is  a slowly  varying  function  of  y so  that 


An  effective  index  profile  neff(y)  is  defined  to  satisfy 
-fat + ko[n2(x,y)  - n2n(y)]  Fi  = 0 . 


(3.61) 


(3.62) 


As  shown  in  Fig.  3-11,  the  channel  waveguide  is  sliced  vertically  in 
the  depth  direction  normal  to  the  surface  of  the  sample  with  either 
uniform  or  nonuniform  slice  thickness.  The  thickness  of  the  slice 
should  be  small  enough  to  accommodate  index  variations.  The 
thickness  can  be  judicially  chosen  with  its  value  increasing 
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Fig.  3-11.  The  procedure  in  the  effective  index  method  for 
evaluating  the  mode  index  of  2-D  waveguides. 
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monotonically  with  the  distance  between  the  slice  and  the  center  of 
the  waveguide.  Since  there  is  no  sharp  boundary  in  the  y-direction  for 
the  ion-exchanged  channels,  the  location  of  the  slices  is  not  that 
critical.  The  nonuniform  spacing  approach  uses  less  memory  and  aids 
in  saving  computer  CPU  time.  Each  sliced  layer  has  its  own  mode 
index  neff(y)  and  modal  field  Fi(x,y).  neff{y)  is  evaluated  for  each  layer 
by  assuming  that,  within  the  layer,  the  index  is  invariant  in  the  y- 
direction  and,  hence,  only  a 1-D  index  profile  is  considered  in  solving 
Eq.  (3.62).  To  solve  for  the  mode  index  of  each  slice  in  Eq.  (3.62),  we 
utilize  the  1-D  matrix  method  in  which  only  a 2X2  matrix  is  involved. 
Hence,  the  1-D  matrix  method  is  suitable  for  a memory-limited 
calculation  environment,  e.g.,  a personal  computer.  Once  the  effective 
index  profile  neff(y)  is  solved,  by  substituting  Eqs.  (3.60),  (3.61),  and 

(3.62)  into  Eq.  (3.36),  the  2-D  Helmholtz  equation  is  reduced  to  a 1-D 
Helmholtz  equation: 

+ k§[n?ff(y)  - N2]  F2  = 0 . (3.63) 

Thus,  the  mode  index  N of  a 2-D  index  profile  can  be  solved  by  solving 
the  equivalent,  1-D  graded-index  profile,  namely,  nefffy)  by  again  using 

the  1-D  matrix  method.  Although  the  effective  index  method  is 
suitable  for  the  fundamental  mode  since  its  use  is  restricted  by  the 
assumption  of  a slowly  varying  field  Fi(x,y)  in  the  derivation  of  the 
algorithm,  we  have  verified  its  validity  for  both  the  fundamental  mode 
of  channel  waveguides  and  the  first  two  normal  modes  of  the  weakly 
coupled  directional  couplers.  Nevertheless,  the  effective  index 
approach  is  known  to  fail  near  the  cutoff  [Chia86]. 
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A quasi-TE  mode  of  a channel  waveguide  has  its  polarization 
parallel  to  the  surface  of  the  guide  (y-direction).  Therefore,  in  this 
case,  Fi  in  Eq.  (3.62)  represents  the  transverse  E field.  However,  in 
order  to  maintain  the  polarization  in  the  same  direction,  a TM  mode  is 
used  in  the  solution  of  Eq.  (3.63),  where  F2  is  assumed  to  represent 
the  transverse  H field.  On  the  other  hand,  if  we  had  considered 
instead,  the  TM  mode  in  Eq.  (3.62),  F2  in  Eq.  (3.63)  would  now 
represent  the  transverse  E field  corresponding  to  the  TE  mode  in  the 
equivalent,  planar,  graded-index  waveguide  in  the  y-direction.  Our 
numerical  results  (presented  later  in  chapter  6)  indicate  that  there  is 
little  difference  in  the  TE  and  the  TM  mode  indices  due  to  the 

asymmetry  of  the  channel  waveguide  between  the  TE  and  the  TM 
modes. 

Finite  Difference  Method 

The  1-D  finite  difference  method  can  be  extended  to  the  2-D 
finite  difference  method  by  defining  mesh  points  similar  to  those 
shown  in  Fig.  3-4.  In  this  case,  Eq.  (3.59)  for  1-D  waveguides  becomes 

A^xa^  F Wxi  = b F%3^x1  , (3.64) 

where  A is  a block  tri-diagonal  symmetrical  matrix  and  equals 
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boundaries  have  been  assumed  to  be  zero.  Since  the  matrix  A is  band 
symmetrical,  we  do  not  need  to  solve  for  all  the  eigen-solutions 
because  commercial  subroutines  are  available  to  obtain  a couple  of 
largest  eigen-values  and  vectors,  which  are  associated  to  the  lower 
order  modes.  Thus,  less  CPU  time  and  memory  are  needed.  Even 
though  the  CPU  time  is  reduced  by  using  a symmetrical  matrix,  it  still 
takes  about  200  seconds  CPU  time  in  the  IBM  3090-600J  computer 
with  vector  processors  for  3Gc=%=51,  which  produces  a 2600  by  2600 
matrix.  If  nonuniform  mesh  sizes  are  used  in  the  2-D  finite  difference 
method,  the  CPU  time  may  become  an  order  of  magnitude  larger  and 

actual  run  time  may  be  much  longer  due  to  the  lower  priority  of  the 
job. 
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Compared  to  the  effective  index  method,  the  2-D  finite 
difference  method  is  more  accurate  as  long  as  enough  points  are 
taken,  which  may  result  in  the  need  for  a supercomputer.  The 
algorithm  described  above  does  not  distinguish  betwwen  the 
polarizations  of  the  modes.  Therefore,  it  is  not  suitable  to  simulate 
devices  which  are  birefringent.  If  the  mode  is  close  to  cutoff,  the 
calculation  boundaries  and  the  number  of  points  must  be  increased  to 
maintain  the  same  accuracy. 

3.4.  Propagating  Wave — Beam  Propagation  Method 

The  beam  propagation  method  (BPM)  is  a powerful  tool  for 
dealing  with  the  propagation  of  optical  beam  through  a medium  with  a 
small  index  variation.  This  method,  developed  by  Fleck  et  al  [Flec76] 
in  1976,  has  found  wide  applications  to  the  simulation  of  optical 
guided-wave  devices,  e.g.,  fibers  [Feit78,  Feit79,  Feit80],  tapered 
waveguides  [Baet82],  Y-branches  [Baet82],  directional  coupler  [Feit83], 
cross  couplers  [Neye85],  modulators  [Dani84],  nonlinear  directional 
couplers  [Thyl86],  grating  devices  [Yevi82]  and  lasers  [Yevi86,  Kacz88, 
Fede89].  With  only  a few  assumptions  [Vanr81],  BPM  shows  its 
flexibility  and  precision  in  modeling  almost  all  kinds  of  propagation 
problems.  The  accuracy  of  BPM  has  been  demonstrated  by  simulating 
the  propagation  of  an  eigenmode  in  a straight  waveguide.  Furthermore, 
it  is  not  necessary  to  solve  for  the  mode  field  or  the  mode  index  of  the 
waveguide  in  the  evolution  of  a guided-wave  by  BPM.  However,  this 
method  provides  numerical  results,  not  analytical  solutions.  On  the 
other  hand,  in  some  devices,  e.g.,  the  directional  couplers,  the 
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coupled-mode  theory  (CMT)  [Mill54,  Yari73]  may  give  analytical 
expressions  to  predict  the  device  characteristics.  But  the  accuracy  of 
the  prediction  depends  on  the  validity  of  some  assumptions  such  as 
the  weak  coupling  in  the  coupled-mode  theory,  it  is  not  applicable  to 
strongly  coupled  waveguides.  Device  simulation  can  also  be  done  by 
evaluating  the  local  normal  mode  coupling  at  several  steps  along  the 
longitudinal  direction  [Burn88a,  Bum88b,  Burn89].  The  disadvantage 

of  this  method  is  the  need  to  calculate  the  normal  mode  fields  at  each 
step. 

This  section  will  present  the  theory  of  BPM,  the  numerical 
implementation  and  its  application  to  a 3-dB  asymmetrical  cross 
coupler.  Emphasis  will  be  placed  on  the  propagation  in  2-D  structures 
(y  and  z directions)  although  most  of  the  integrated  optical  devices  are 
3-D  in  nature.  The  equivalent  index  profiles  can  be  obtained  by  slicing 
the  3-D  index  profile  along  the  z-direction,  assuming  that  each  slice  is 
z-invariant  and  converting  the  2-D  (x  and  y directions)  index  profile  of 
each  slice  into  a 1-D  (y  direction  only)  effective  index  profile  by  the 
effective  index  method.  The  implementation  of  BPM  is  quite  easy. 
However,  due  to  the  limitations  of  the  method,  care  must  be  taken  in 
the  calculation  in  some  cases,  which  will  be  mentioned  in  this  section. 

3-4.1.  BPM  Formulation 

Beam  propagation  method  is  one  of  the  wave-optics  approaches 
for  the  analysis  of  the  propagation  of  the  beam  in  a graded -index 
medium.  3-D  as  well  as  2-D  simulation  can  be  be  performed  by  BPM. 
Recently  many  simulations  have  been  done  by  BPM  because  of  its 
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power.  Several  modified  BPM’s  [Nyma89,  Yevi89a,  Yevi89b,  Yevi89c, 
Yevi90a,  Yevi90b]  were  proposed  over  the  last  two  years.  However, 
since  less  assumptions  have  been  made  for  the  standard  BPM,  only  the 
standard  one  illustrated  by  Okoshi  [Okos85]  will  be  discussed  here. 

Considering  the  vector  Helmholtz  equation  including  the 
longitudinal  components  Ez  and  Hz,  Eqs.  (3.23)  and  (3.24)  can  be 
rewritten  as 

V2E  + v|E-VlneJ  + co2ep0  E = 6 (3.65) 

V2H  + (Vine)  x |Vxh)  + co28|i0  H = 0 , (3.66) 


Again  with  the  assumption  of  a slowly  varying  index  profile.  Vine  = 0, 
the  index  gradient  terms  of  Eqs.  (3.65)  and  (3.66)  can  be  neglected. 
One  then  obtains 


V2E  + k§n2  E = 0 (3.67) 

V2H  + k§n2  H = 6 , (3.68) 


where  n is  a function  of  x,  y,  and  z;  k0  is  the  wave  number  in  vacuum. 
Let  F represent  the  E or  H fields  for  the  quasi-TE  or  quasi-TM  modes, 
respectively.  The  wave  equation  becomes 


a2F 

9z2 


= - (V2+k§n2)F  = - AF 


(3.69) 


where  V?  is  the  transverse  Laplacian  and  sn  is  an  operator.  The 

above  equation  has  the  solution  at  z=Az  in  terms  of  the  field  at  z=0  as 
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F(x,y,Az)  = exp(±j*Az)  F(x,y,0) 

= exp(-j*A^  F(x,y,0),  (3.70) 


where  the  plus  sign  for  the  reflected  wave  is  neglected.  The  operator 
*1/2  can  be  decomposed  into  three  terms: 


A1'2  = (Vj+kgn2)1'2 


V2 


1 


+ koTL 


(Vj+k§n2)1/2+k0n 

V?  / \ 

— — 7 Yb + k°n°  + k°n°h?-- 1 • 

(Vi+k§n2) 1 2+k0n  ln°  1 


(3.71) 


where  nQ  is  a reference  index,  independent  of  position.  n0  is  typically 
set  equal  to  the  substrate  index  or  the  approximate  mode  index  in  the 
waveguide.  It  is  also  assumed  that  the  position-dependent  index  n in 
the  first  term  of  RHS  of  Eq.  (3.71)  is  equal  to  n0.  Therefore,  the 
operator  A1/2  is  expressed  as  the  summation  of  the  propagation  in  the 
homogeneous  medium  with  index  n0  and  a position-dependent  phase 
correction  term  %(x,y,z): 


A112  = 


V2 


i 


9 _ 1 10  + k0n0  + k0n0|-D^  - 1) , (3.72) 

(V2+k§n§)  2+k0n0  'n° 


where  %(x,y,z)=k0n0(rL  -1)  and 

no 


n-no 


no 


«1.  The  phase  of  F can  be 


approximated  to  be  exp(-jk0n0z)  since  we  assume  n = nD.  Thus,  the 
field  F becomes 


F(x,y,z)  = F(x,y,z)  exp(-jk0n0z)  , 


(3.73) 
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where  the  amplitude  is  a slowly  varying  function  of  z and  the  beam  is 
assumed  to  be  quite  parallel  to  the  z-direction.  Substituting  Eq.  (3.73) 
and  (3.71)  into  Eq.  (3.69),  we  have 


F(x,y,z+Az)  = exp  -jAz 


= e>q  -j4p 


exP|  -j4f 


vi 


(Vi+k§n§)1/2+k0n0 


+ X 


/J 


F(x,y,z)  + 0(Az3) 


V2 


i 


(V2+k§ng)1/2+k0n0 


exP(-j%Az) 


V2 


i 


(V2+k§n§)1/2+k0n0 


F(x,y,z)+0(Az3)  . 


(3.74) 


The  first  factor  in  the  RHS  of  Eq.  (3.74),  exp 


_,Az 


vi 


l(Vi+king)1/2+k0n0/ 


is  equivalent  to  the  wave  propagation  in  a homogeneous  medium  with 
index  n0  for  a distance  Az/2,  as  shown  in  Fig.  3-12  for  z between  0 and 
Az/2.  The  factor  exp(-j%Az),  which  is  position-dependent,  represents  a 
phase  correction  term.  Once  the  initial  condition  F(x,y,z)  is  given,  the 
field  at  z+Az  can  be  evaluated  by  Eq.  (3.74).  Repeating  the  procedure, 
the  propagation  of  the  wave  can  be  traced  in  a step-by-step  manner. 
The  analogy  of  Eq.  (3.74)  is  equivalent  to  propagating  an  optical  beam 
through  a periodic  array  of  thin  lenses  as  shown  in  Fig.  3-12  [Feit78], 
where  the  lenses  serve  the  purpose  of  modifying  the  phase  of  the 
beam.  The  complicated  propagation  problem  is,  then,  converted  into  a 
simple  calculation  procedure.  The  smaller  step  size  Az,  the  smaller 
the  truncation  error. 
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Fig.  3-12.  The  equivalent  analogy  of  wave  propagating  in  a medium 
by  BPM  expressed  in  Eq.  (3.74). 
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3.4.2.  Numerical  Implementation 


Equation  (3.74)  is  the  basic  BPM  equation  for  general  cases.  It 
can  be  reduced  to  a 2-D  expression  by  writing  F(x,y,z)  as  F(y,z),  where 
z is  the  propagation  direction  and  y is  the  transverse  direction  parallel 
to  the  waveguide  surface.  The  effective  index  method  is  used  to 
convert  the  3-D  (x,y,z)  index  profile  into  a 2-D  (y,z)  problem  by  slicing 
the  devices  vertically  along  x direction  and  calculating  the  effective 
index  profile  as  a function  of  y for  each  slice  as  mentioned  earlier.  The 
difference  is  that  the  mode  index  of  the  effective  index  profile  is  not 
evaluated.  The  calculation  involves  solving  for  the  propagation  of  the 
wave  in  a homogeneous  medium  n0  which  is  described  by  Okoshi 
[Okos85].  It  will  be  briefly  introduced  here. 

The  first  step  is  to  evaluate  the  Fourier  transform  Fn(z)  of  the 
field  F(y,z): 


%!  2 


F(y,z)=  £ 

n=-Xj2 


Fn(z)  exp(jkyny) , 


(3.75) 


where  X is  the  total  number  of  points;  kyn=  -j-n;  and  L is  the  width  of 

the  computational  area  in  the  y direction.  The  field  at  z+Az/2,  before 
the  lens,  is  expressed  as 


Fn(z+4p 


= exp|  -jAz. 


V2 


1 


(Vi+k§n§)1/2+k0n0/ 


Fn(z)  , (3.76) 


2 

where  V±  can  be  replaced  by  -kfn  in  the  calculation.  To  evaluate  the 
field  just  after  the  lens,  we  need  the  field  in  the  spatial  domain  so  that 
the  phase  can  be  calculated  by  the  phase  correction  term.  Therefore, 
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in  the  second  step  the  inverse  Fourier  transform  is  employed  to  yield 
F(y,z+Az/2)  before  the  lens.  Next  this  field  is  multiplied  by  exp(-j%Az) 
to  obtain  the  phase  corrected  field  and,  then,  propagating  for  Az/2  to 
complete  the  calculation  for  the  section  Az.  Since  the  field  is  in  ky- 
domain  at  z+Az,  the  calculation  of  Fourier  transform  is  not  necessary 
for  the  next  section  before  the  lens.  Hence,  one  Fourier  transform  and 
one  inverse  Fourier  transform  are  involved  in  the  propagation  through 
one  section.  The  CPU  time  may  be  saved  by  using  the  fast  Fourier 
transform  (FFT).  In  the  calculation  the  mode  index  and  the  mode  field 
are  not  needed.  The  other  advantage  is  that  the  radiation  modes  are 
not  neglected.  However,  there  are  some  limitations.  First,  the  index 
must  be  close  to  the  reference  index  n0  and  the  step  size  Az  must  be 
small  enough.  Because  of  the  nature  of  FFT,  the  boundary  condition  is 
periodic  over  the  calculation  width  L.  Therefore,  absorbers  must  be 
placed  at  the  calculation  boundary  so  that  the  field  is  not  transmitted 
beyond  the  boundaries  or  reflected  to  the  waveguide  region.  The 
number  of  mesh  points  must  be  the  power  of  2 and  the  mesh  points 

are  equally  distributed.  BPM  is  also  applicable  to  the  simulation  of  3-D 
devices.  In  this  case,  the  2-D  FFT  is  used  and  V?  = -k£n-k£n. 

3-4.3.  Simulation  of  Cross  Coupler  bv  BPM 

The  simulation  of  a cross  coupler  is  given  here  as  an  example  to 
illustrate  the  power  of  BPM.  Cross  couplers  have  wide  applications  in 
optical  signal  processing,  e.g.,  3dB  power  divider  [Huss89,  Huss90]. 
The  schematic  of  an  asymmetric  cross  coupler  is  shown  in  Fig.  3-13. 
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Fig.  3-13.  Schematic  diagram  of  a cross-coupler.  The  input 
branches  A and  B are  asymmetrical,  while  the 
output  branches  C and  D are  symmetrical. 
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The  waveguide  width  WB  is  wider  than  WA  at  input  and  WC=WD  at 
output.  As  long  as  no  mode  conversion  occurs,  namely,  the  adiabatic 
condition  is  satisfied  [Huss89,  Huss90],  the  power  inputs  from  branch 
A and  B will  keep  in  the  first  higher  mode  and  the  fundamental  mode, 
respectively,  through  the  device.  It  is  obvious  that  the  fundamental 
normal  mode  and  first  higher  mode  have  equal  powers  at  output 
branches  since  the  structure  is  symmetric  at  the  output.  Thus,  3-dB 
power  splitting  is  possible  if  the  wave  is  input  either  from  port  A or 
port  B.  The  simulation  is  shown  in  Fig.  3-14  for  WA=6.0  pm,  WB=8.0 
pm  and  WC=WD=7.0  pm.  The  angle  0 is  1.25  mrad.  The  figure  shows 
the  magnitude  of  the  field  as  functions  of  y and  z.  The  power  input 
from  branch  B is  equally  divided  at  the  output  ports  C and  D.  BPM  can 
also  be  used  to  predict  the  radiation  loss  of  the  devices.  This  will  be 
discussed  in  chapter  5 for  the  case  of  the  tapered  waveguides. 

3.5.  Summary  of  Numerical  Methods 

Systematic  modeling  for  integrated  optics  is  possible  if  we  can 
simulate  the  index  profiles  and  the  fields  in  waveguide  devices.  In 
most  cases,  analytical  solutions  are  not  available.  Therefore,  numerical 
methods  are  of  paramount  importance.  In  this  chapter,  the  Green’s 
function  method  for  the  linear  diffusion  equation  and  the  finite 
difference  method  for  solving  the  nonlinear  diffusion  equation  in  the 
case  of  1-  and  2-D  waveguides  were  introduced.  The  methods  for 
solving  the  1-D  Helmholtz  equation  that  were  discussed  in  this 
chapter  include  the  WKB  method,  the  matrix  method,  and  the  finite 
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Fig.  3-14.  Simulation  of  cross-coupler  by  BPM.  The  wave  is  launched 

from  the  left  hand  side  and  outputs  from  the  right  hand  side. 
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difference  method.  The  effective  index  method  and  the  extended  2-D 
finite  difference  method  used  to  solve  the  2-D  Helmholtz  equation 
were  also  presented.  Finally,  the  BPM  was  presented  and  its  use  was 
demonstrated  by  propagating  a wave  through  an  asymmetric  2X2  cross 
coupler. 


CHAPTER  4 

FABRICATION  AND  CHARACTERIZATION 
4.1.  Fabrication 

The  processing  steps  involved  in  the  fabrication  of  ion- 
exchanged  glass  waveguides  are  quite  simple  and  cost-effective.  It  can 
be  done  at  relatively  low  temperatures  (200~400°C)  compared  to  the 
fabrication  of  semiconductor  waveguides.  In  general,  the  fabrication 
process  is  quite  similar  to  the  standard  process  in  the  well-developed 
silicon  technology  in  the  sample  preparation  and  photolithography 
stages;  this  reduces  the  cost  in  the  setup  of  the  facility  if  the 
semiconductor  processing  equipments  are  already  available.  Clean 
areas  and  hoods  are  generally  required  to  avoid  contamination  from 
the  environment  during  the  diffusion.  In  the  diffusion,  an  external 
electric  field  can  be  applied  to  speed  up  the  diffusion  and,  thus, 

reduce  the  processing  time  or  temperature,  as  described  in  the 
previous  chapters. 

Several  commercial  glass  substrates  are  available  for  fabricating 
ion-exchanged  waveguides.  Among  them,  BK7  glass  has  high  optical 
homogenity,  low  loss  at  the  wavelengths  of  interest  and  the  ability  to 
withstand  the  attack  of  molten  salt,  specifically  the  nitrate  melt.  The 
sodium  and  potassium  ions  in  BK7  glass  (see  Table  1 for  glass 
composition)  are  suitable  for  exchanging  since  they  are  glass  network 
modifiers  and  have  rather  high  diffusion  coefficients  compared  to 
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other  ions  in  this  glass.  The  transition  temperature  of  the  glass  is  also 
higher  than  the  temperature  required  for  silver  or  potassium  ion 
exchange.  Therefore,  we  have  chosen  this  glass  for  the  ion  exchange 
process. 

As  described  in  chapter  2,  silver-ion  exchanged  waveguides  can 
be  fabricated  by  applying  a field  across  the  sample  with  the  silver  thin 
film  as  an  anode  [Vilj80].  The  metallic  silver  is  oxidized  and  driven  by 
the  external  field  into  the  substrate  to  form  waveguides.  However,  in 
this  project  we  emphasize  the  exchange  performed  in  the  molten  salt 
bath  since  the  diffusion  setup  is  simpler  and  easier  and  yields  low-loss 
waveguides.  The  process  of  fabricating  high-performance,  low-loss 
optical  waveguides,  briefly  described  in  [Sriv87b,  Zhen89,  Chen90b], 
is  detailed  in  this  chapter.  The  fabrication  of  1-D  waveguides  will  be 
presented  first  since  the  fabrication  is  easier  than  2-D  waveguides. 
The  process  will  be  described  in  section  4.1.1  for  1-D  waveguides  and 
4.1.2  for  2-D  waveguides. 

4.1.1.  One-Dimensional  Waveguides 

One-dimensional  waveguides  are  fabricated  on  a planar  substrate 
without  any  masks  on  the  glass  and  are  useful  in  the  characterization 
of  the  diffusion  parameters  such  as  the  self-diffusion  coefficient,  the 
activiation  energy,  etc.,  and  waveguide  parameter  e.g.,  the  maximum 
index  change.  The  process  includes  melt  preparation,  sample 
preparation,  diffusion,  and,  finally,  waveguide  cleaning  and  polishing. 
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Melt  Preparation 

An  aluminum  beaker  is  used  in  the  diffusion  because  the 
oxidized  surface  layer  on  the  beaker  is  inert  to  the  nitrate  melt  and 
both  the  A1  and  its  ions  cannot  easily  diffuse  into  the  sample.  Before 
preparing  the  melt,  the  container  is  thoroughly  cleaned  by  using  a 
commercial  aluminum  etchant  at  low  temperature  for  two  or  three 
hours.  A1  etchant  is  rinsed  away  in  de-ionized  (DI)  water.  Next  the 
beaker  is  dried  by  a nitrogen  gun  and  baked  on  the  hot  plate  for  more 
than  an  hour  at  ~300°C  so  that  its  surface  is  oxidized. 

The  preparation  of  the  melt  depends  on  the  ions  used  in  the 
exchange.  For  Ag+-Na+  exchange  in  BK7,  the  salt  bath  consists  of 
0.333-MF  KNO3,  0.666-MF  NaN03  and  10'3-MF  AgN03  which  gives  an 
index  change  of  0.012.  Higher  index  change  can  be  obtained  by 
increasing  the  concentration  of  silver  ions.  The  purpose  of  KN03  in 
the  salt  mixture  is  to  prevent  the  K+  out-diffusion  [Zhen89];  out- 
diffusion  of  K+  would  cause  cracks  in  the  sample  [Bran86,  Zhen89].  A 
typical  bath  contains  300  grams  of  NaN03,  178.44  grams  KN03  and 
900  mg  AgN03.  The  silver  nitrate  is  weighed  in  a small  teflon  beaker 
to  an  accuracy  of  milligrams  and  added  to  the  molten  salt  at  330°C. 
The  salt  bath  must  remain  clear  during  the  addition  of  AgN03.  If  the 
melt  is  contaminated,  it  is  likely  that  a dark  or  high-reflection  thin 
film  floating  on  the  melt  surface  would  be  observed.  Most  frequently 
the  contamination  comes  from  the  salts.  Hence,  ultrapure  or  ACS 
grade  chemicals  are  recommended.  Sometimes  a careless  cleaning  of 
a thermocoupler  or  stirrer  may  cause  the  melt  contamination. 


101 


Sample  Preparation 

First,  a 2”X2”X1  mm  substrate  glass  is  fixed  on  the  ceramic  by 
wax  and  cut  by  a wafer  saw  into  appropriate  sizes,  which  depend  on 
the  application.  For  cross  couplers,  the  length  of  samples  can  be  as 
long  as  2 inches.  Usually,  the  size  of  samples  is  1X1  inch  square  for 
straight  channel  waveguides.  Since  glass  is  isotropic,  the  orientation  of 
the  samples  is  not  important  in  cutting.  After  cutting,  each  sample  is 
marked  by  the  diamond  pen  and  cleaned. 

Sample  cleaning  is  done  in  a clean  room  under  a hood  because 
some  chemicals  are  inflammable  and  hazardous  to  health.  In  the 
cleaning  process,  a dust-free  N2  gun  is  used  to  dry  the  samples.  The 
wax  on  the  samples  is  removed  by  alternatively  and  repeatedly 
immersing  the  samples  into  warm  and  fresh  acetone  and  1,1,1- 
Trichloroethane  (TCA)  in  separate  glass  beakers.  Each  beaker  must  be 
labeled  and  used  for  only  one  chemical.  After  the  wax  is  removed,  a 
standard  cleaning  procedure  is  followed;  the  samples  are  tenderly 
washed  with  soap  and  DI  water  and  dried  by  the  N2  gun.  Care  must  be 
exercised  due  to  the  sharp  edges  of  the  glass  after  cutting.  Next,  the 
samples  are  subjected  to  sequential  and  quick  immersions  in  warm 
and  fresh  acetone,  TCA,  acetone  and  methanol  for  two  minutes  each 
and  prebaked. 

Prebaking  and  Diffusion 

The  prepared  substrates  are  transferred  from  the  clean  room  to 
the  processing  laboratory  in  air-tight  sample  boxes  and  prebaked  in  an 
oven  for  several  hours  at  200°C  to  remove  any  moisture  from  the 
surface.  The  molten  salt  is  ready  for  diffusion  when  it  reaches  the 
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desired  diffusion  temperature.  The  diffusion  setup  is  shown  in  Fig.  4-1. 
A sample  is  placed  on  the  bottom  of  the  aluminum  container  with  a 
thermocoupler  next  to  the  sample  to  accurately  sense  the  substrate 
temperature.  A 0.5-mm  spacer  is  inserted  between  the  beaker  and  hot 
plate  to  keep  the  beaker  from  direct  contact  with  the  hot  plate  since 
the  temperature  variation  on  the  hot  plate,  controlled  by  an  ON/OFF 
relay  in  the  temperature  controller,  may  be  large  and  nonuniform.  A 
teflon  tape  wrapped  shield  and  a cover  are  used  to  insulate  the  beaker 
from  the  ambient.  This  allows  desirable  temperature  uniformity  in  the 
melt,  to  within  a few  degrees  C in  the  vertical  direction  and  less  than 
1 C in  the  horizontal  direction.  No  stirrer  is,  therefore,  required 
which  simplifies  the  setup.  The  thermocoupler  is  calibrated  by  a 
mercury  thermometer  prior  to  the  diffusion  since  difference  in  the 
readings  by  the  common  thermocoupler  and  mercury  thermometers 
may  be  as  large  as  10°C.  The  temperature  control  and  a precise 
temperature  setting  are  very  crucial  since  the  waveguide 
characteristics  strongly  depend  on  the  diffusion  temperature. 

Before  the  diffusion,  the  samples  are  heated  by  holding  them 
just  above  the  melt  for  a couple  of  minutes.  A timer  is  used  to  monitor 
the  diffusion  time.  Once  the  samples  are  immersed  and  covered  by  the 
molten  salt,  the  timer  is  started.  During  the  diffusion  process,  the 
substrates  must  be  free  of  air  bubbles  on  the  surface  otherwise  they 
will  cause  inhomogeneous  diffusion.  The  bubbles  can  be  wiped  out  by 
the  surface  tension  of  the  salt  bath.  They  are  removed  by  repeatedly 
taking  the  substrates  out  of  the  melt  and  quickly  lowering  them  into 
the  melt.  The  single-mode  waveguides  at  1.3  pm  wavelength  can  be 
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Fig.  4-1.  The  diffusion  setup  used  in  the  ion-exchange  process. 
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fabricated  at  340°C  by  diffusing  for  about  8-12  hours  in  BK7  glass. 
Multimode  waveguides  are  attainable  by  increasing  the  temperature  or 
the  diffusion  time.  After  the  diffusion  is  completed,  the  withdrawal  of 
the  samples  from  the  melt  should  be  slow  so  that  no  microcracking  is 
induced  in  the  glass.  The  substrates  are  pulled  out  of  the  melt  and  set 
aside  to  cool  down  to  room  temperature. 

Waveguide  Cleaning  and  Polishing 

The  salt  on  the  glass  is  removed  by  rinsing  the  glass  in  DI  water. 
The  waveguides  are  cleaned  again  using  the  same  standard  procedure 
described  earlier  in  this  section. 

The  samples  must  be  thoroughly  cleaned  before  polishing  in 
order  to  avoid  scratches  from  dust  particles.  First,  using  Bee-wax,  a 
small  and  cleaned  glass  pad  is  attached  at  edge  of  the  waveguide 
surface  to  be  polished,  as  shown  in  Fig.  4-2(a).  The  small  pad  serves  as 
a protection  from  edge  chipping  during  the  polishing  since  the 
waveguides  are  usually  only  a few  microns  deep.  The  width  of  the  pad 
is  about  3 mm  and  the  length  is  the  same  as  the  width  of  the  subtrate. 
The  pad-waveguide  assembly  is  placed  on  the  holder  such  that  the 
distance  between  the  substrate  edge  to  be  polished  and  the  holder 
edge  is  about  2 mm.  In  the  next  step  the  sample  and  holder  are  tightly 
clamped  together  at  wax  melting  point  so  that  the  gap  between  the 
waveguide  and  the  pad  is  minimized  (less  than  5 pm)  as  shown  in  Fig. 
4-2(b).  Next  a brass  block  is  used  to  hold  the  clamp  and  the  sample,  as 
shown  in  Fig.  4-2(c).  The  substrate  must  exceed  the  surface  of  the 
block  by  approximately  1 mm  so  that  this  part  can  be  easily  polished 
away.  To  avoid  damaging  the  block  in  the  polishing,  some  glass  slides 
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(a) 


(b) 


Screw 
Clamp 
Bee  wax 
Substrate 
Holder 


Fig.  4-2.  The  sample  holder  for  polishing  glass  waveguides,  (a)  Substrate 
and  glass  pad  waxed  on  a holder,  (b)  Side  view  of  the  clamped 
sample  and  holder. 
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Glass  plates 


Brass  block 


(C) 


Fig.  4-2  - continued,  (c)  Brass  block  with  sample  holder.  The  shaded 
areas  are  the  glass  slides  waxed  on  the  surface  of  the  brass 
block. 
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are  waxed  on  the  block  surface  as  shown  by  the  shaded  areas  in  Fig.  4- 
2(c).  The  slides  can  also  slow  down  the  polishing  rate. 

The  polishing  procedures  are  described  as  follows.  The  sample 
is  first  ground  by  hand  with  grit  400  SiC  powder  until  the  surface  of 
the  polished  sample  edge  is  on  the  same  level  as  the  block  surface. 
The  surface  of  the  sample  edge  is  not  smooth  due  to  the  roughness  of 
the  powder.  After  cleaning  the  block  and  the  sample  with  water,  the 
sample  is  polished  by  5 pm  alumina  powder  for  about  two  minutes. 
Note  that  the  position  of  the  clamp  on  the  brass  block  must  be  kept 
fixed  to  assure  good  polishing  of  the  waveguide.  The  block  is  again 
rinsed  in  water  and  dried  by  a N2  gun.  In  the  next  polishing  step,  1 
pm  diamond  paste  on  a nylon  cloth,  held  by  an  adhesive  back  on  a 
smooth  disc,  is  used.  In  this  step,  the  block  rotates  as  the  disk  spins. 
Since  the  rotations  of  the  block  and  the  disc  does  not  occur  around 
the  same  axis,  uniform  polishing  can  be  achieved.  The  polishing  may 
last  for  30-60  minutes  depending  on  the  scratch  sizes  before  the  fine 
polishing.  After  polishing  and  cleaning,  the  sample  is  checked  under  a 
microscope  with  oblique  incidence  light  so  that  the  scratches  on  the 
surface  can  be  easily  seen.  A finer  polish  with  a quarter  micron 
diamond  paste  may  be  required  for  small-size  (single-mode) 
waveguides.  The  edge  of  the  sample  at  the  pad-waveguide  interface 
should  be  smooth  and  flat  like  a mirror  after  this  step  of  processing. 
After  polishing,  the  pad  is  removed  by  melting  the  wax.  The  bee  wax 
on  the  waveguide  is  removed  by  warm  TCA.  If  any  residue  is  left  on  the 
waveguide,  it  is  removed  by  soap  and  DI  water.  The  samples  are  stored 
in  a round  sample  box  with  the  waveguide  side  up  so  that  the  polished 
edges  may  not  be  damaged  by  the  box  wall.  If  the  glass  substrates  are 
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damaged  or  even  broken,  then  all  the  efforts  are  done  in  vain. 
Polishing  requires  great  care  even  though  it  is  not  the  most  critical 
step  in  dictating  the  performance  of  the  devices. 

4.1.2.  Two-Dimensional  Waveguides 

The  fabrication  process  of  the  2-D  or  channel  waveguides  is 
similar  to  the  1-D  waveguides  except  that  some  more  steps  are 
needed  to  define  the  channels.  The  processes  include  melt 
preparation,  sample  preparation,  mask  evaporation,  photolithography 
etching,  prebaking,  diffusion,  polishing,  and  removal  of  mask,  where 
the  underlines  highlight  the  additional  steps  for  the  2-D  waveguides. 
They  are  described  in  this  subsection. 

Choice  and  Evaporation  of  the  Mask  Material 

The  purpose  of  the  mask  is  to  prevent  the  diffusion  of  the  silver 
ions  into  the  undesired  regions  in  the  glass.  The  choice  of  the  mask 
material  is  important  because  the  mask  must  be  able  to  1)  bond  to  the 
substrate  tightly,  2)  tolerate  the  attack  of  the  nitrate  melt  at  the 
diffusion  temperature  for  the  desired  duration,  3)  not  react  with  the 
silver  ions,  4)  not  contaminate  the  melt,  5)  not  diffuse  into  the 
substrate,  6)  not  affect  (either  retard  or  accelerate)  the  silver-sodium 
exchange  in  the  mask  opening  region,  7)  prevent  the  penetration  of 
the  silver  ions  in  the  masked  region,  8)  dissolve  easily  in  a solvent 
after  diffusion,  9)  open  windows  with  sharp  edge  by  the  etching  or 
liftoff  processes,  and  10)  be  seen  under  the  microscope  in  the  mask 
aligner  for  alignment.  Many  materials  have  been  tried  in  this 
laboratory,  e.g.,  Al,  Ti,  Cr,  Ag,  A1203,  Si02,  etc.  It  is  found  that  the  best 
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choice  is  a mask  consisting  of  an  A1  film  on  a thin  AI2O3  layer  since 
this  structure  eliminates  the  shortcomings  of  both  A1  and  AI2O3  films 
used  alone.  It  was  found  that  the  aluminum  film,  used  for  years  in  our 
laboratory,  may  induce  a local  field  at  the  edges  of  the  mask  opening 
just  under  the  mask  and  reduce  the  silver  ions  into  metallic  form  in 
that  region,  thus,  increasing  the  loss  of  the  waveguides  [Walk83b].  The 
lowest  loss  silver  ion-exchanged  waveguides  are  usually  found  to  be  the 
ones  with  the  least  reduction  of  silver  ions.  The  reduction  of  the  silver 
ions  into  metallic  silver  can  be  identified  by  a yellowish  color  at  edges 
of  the  waveguides.  The  A1203  film  itself  is  not  useful  since  it  is 
transparent  and  not  effective  for  blocking  the  diffusion  of  the  silver 
ions.  However,  a composite  structure  consisting  of  AI2O3  and  A1  films 
eliminates  the  problems  associated  with  the  single-material  film. 

The  mask  is  fabricated  by  E-beam  evaporation  at  a vacuum  of 
about  10'5  torr.  Before  this  step,  the  samples  must  be  completely 
cleaned  and  moisture-  and  dust-free.  First,  a 400  A AI2O3  film  is 
evaporated  very  slowly  (~2A/sec).  The  thickness  is  measured  by  quartz 
crystal  thickness  monitor.  Next,  a 1500  A A1  film  is  evaporated  in 
about  15  seconds,  which  results  in  the  structure  shown  in  Fig.  4-3(a). 
The  film  should  look  shiny  and  mirror-like  indicating  good  quality.  If 
the  carbon  crucible  for  the  A1  evaporation  is  contaminated  by 
aluminum  carbide,  the  film  may  become  yellowish  and  can  not  be 
easily  etched  away  by  A1  etchant.  Therefore,  the  crucibles  must  be 
changed  frequently. 
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(a)  Mask  Evaporation 
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(c)  UV  Exposure 
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(f)  Photoresist  Removing  (g)  Diffusion 


Fig.  4-3.  Procedures  to  pattern  the  mask  and  fabricate  2-D  waveguides. 
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Photolithography 

After  the  evaporation,  the  samples  are  moved  to  the 
photolithography  room  as  soon  as  possible  for  the  photolithography 
processing.  A N2  gun  with  0.45  pm  filter  is  used  to  blow  the  samples. 
The  sample  is  then  loaded  on  the  photoresist  spinner  which  is  set  at 
4000  rpm  for  35  seconds  spinning  time.  The  Shipley  1400-17 
positive  photoresist  is  dropped  on  the  sample  by  a syringe  with  a 0.45 
pm  filter  until  the  sample  is  covered  by  the  photoresist.  After 
spinning,  the  thickness  of  the  photoresist  film  is  approximately  0.5 
pm.  The  samples  are  then  kept  in  an  oven  for  soft-baking  at  90°C  for 
30  minutes  under  N2  atmosphere.  The  structure  of  samples  at  this 
stage  is  shown  in  Fig.  4-3(b).  The  light  reflected  from  the  samples 
should  be  uniform  and  have  the  same  color,  indicating  the  uniformity 
of  the  photoresist  film. 

The  exposure  of  the  photoresist  is  done  by  UV  light  (-365  nm 
wavelength)  in  the  mask  aligner,  as  shown  in  Fig.  4-3 (c).  The 
photomask,  designed  by  a workstation  VIA- 100,  has  several  band  of 
the  same  patterns  separated  by  a 100  pm  opening.  Each  band  has 
twenty  straight  channels  whose  openings  are  from  0.5  pm  to  10  pm  in 
a step  of  0.5  pm.  The  separation  of  channels  is  75  pm  which  is  much 
larger  than  the  dimensions  of  the  mode  fields.  The  material  of 
photomask  is  usually  anti-reflection  coated  chrome.  Depending  on  the 
intensity  of  the  light,  the  exposure  time  is  set  to  2.3  seconds.  If  the 
diagonally  pin-point  connected,  one-micron  grids,  as  shown  in  Fig.  4- 
4(a)  are  seriously  connected  after  development  without  photoresist 
(Fig.  4-4(b)),  then  it  is  over-exposed.  If  they  are  seriously  separated  by 
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Fig.  4-4.  Patterns  used  to  check  the  exposure  time,  (a)  Pattern  on  the  photomask. 

The  squares  are  1 by  1 pm  square,  pin-point  connected,  (b)  The  squares 
are  seriously  connected  after  development,  (b)  The  squares  are  separated 
by  the  photoresist. 
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the  photoresist  (Fig.  4-4(c)),  then  it  is  under-exposed.  The  exposure 
time  must  be  determined  to  the  precision  of  0. 1 second  or  within  5% 
error.  The  resolution  of  the  photoresist  is  approximately  one  micron. 
The  photoresist  is  developed  for  50  seconds  just  after  the  exposure 
using  Shipley  351  developer  diluted  in  five-part  DI  water  by  volume. 
The  development  must  be  done  at  the  same  temperature  (around 
22°C)  with  gently  stirring  to  assure  reproducible  results.  Next,  the 
sample  is  immediately  rinsed  in  smoothly  flowing  DI  water  for  one 
minute  to  stop  the  action  of  the  developer  and  dried  by  the  filtered  N2 
gun.  Figure  4-3(d)  shows  the  sideview  of  the  sample.  The  patterns  are 
checked  and  measured  under  the  microscope.  If  the  UV  exposure 
time  is  not  correct,  the  pattern  opening  on  the  sample  will  be  either 
larger  or  smaller  than  the  opening  on  the  photomask.  If  the  edge  of 
the  pattern  is  rough,  it  may  be  caused  by  either  the  unclean 
photoresist  or  the  dust  or  the  residue  photoresist  on  the  master  mask 
which  must  be  cleaned  before  every  use  by  the  procedure  described  in 
the  last  subsection.  The  mask  aligner  is  originally  designed  for 
semiconductor  wafers  of  a thickness  of  less  than  20  mils.  For  the 
substrate  thickness  of  40  mils  or  1 mm,  spacers  are  used  to  lift  the 
photomask  so  that  uniform  and  unstressed  contact  between  the  mask 
and  the  sample  can  be  attained.  Hard-baking  of  the  photoresist  after 
development  is  unnecessary  for  the  aluminum  etching  process  since 
the  opened  region  on  the  A1  film  may  be  oxidized  resulting  in  edge 
roughness  in  the  etching  process. 
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Etching 

The  etching  process  opens  windows  for  ion-exchange  in  the 
regions  defined  by  the  photoresist  on  the  A1  and  AI2O3  films.  A 
commercial  PAE  etchant  is  used  in  this  process;  it  consists  of  16  parts 
phosphoric  acid,  two  parts  water,  one  part  nitric  acid,  and  one  part 
acetic  acid.  The  samples  are  immersed  into  the  etchant  for  a couple  of 
minutes  and  then  stirred  mildly.  One  extra  minute  of  etching  time  is 
added  from  the  time  when  the  pattern  is  just  etched  through.  Since 
the  A1  film  may  not  be  the  same  in  each  case,  the  total  etching  time 
may  vary  from  sample  to  sample.  After  etching,  the  samples  are  rinsed 
m DI  water  at  once  to  stop  the  etching  process  and  wash  the  etchant 
away.  The  sideview  of  the  sample  after  etching  is  shown  in  Fig.  4-3(e). 
The  PAE  etchant  is  also  able  to  etch  the  alumina  sandwiched  between 
the  A1  and  the  glass.  However,  the  etching  rate  is  much  slower  than 
that  for  the  A1  film.  Since  the  thin  alumina  layer  hardly  affects  the 
silver-sodium  ion-exchange,  the  etching  of  A1203  film  is  not  critical. 
Although  the  alumina  layer  is  very  thin,  yet  the  film  helps  weaken  the 
local  field  induced  by  A1  film  as  mentioned  earlier  and  decreases  the 
silver  ion  reduction  under  the  mask  at  the  waveguide  edge.  As  a result, 
it  can  reduce  the  propagation  loss  of  the  waveguides.  Again  the  opened 
pattern  on  the  A1  mask  must  be  examined  under  the  microscope  and 
compared  to  the  pattern  on  the  photoresist.  They  should  be  exactly 
the  same.  Over-etched  samples  exhibit  larger  opening  in  the 
waveguide  region  with  rougher  edges  while  under-etched  samples 
show  darker  spots  or  lumps  in  that  region  under  a microscope.  The 
resolution  of  an  optical  microscope  is  around  one  visible  wavelength. 
However,  it  is  still  a convenient  tool  since  the  A1  mask  is  opaque  and 


115 


can  be  easily  inspected.  The  width  deviation  of  the  waveguide  and  the 
edge  smoothness  of  mask  opening  are  primarily  determined  by  the 
photolithography  and  the  etching.  Therefore,  these  two  steps  are 
crucial  in  the  fabrication  of  high  quality  reproducible  waveguides. 

After  the  etching  process,  the  photoresist  is  removed  by  acetone 
and  the  sample  cross-section  appears  as  shown  in  Fig.  4-3(f).  To 
assure  a complete  removal  of  the  photoresist,  a photoresist  remover 
with  neutral  pH  value  is  used  since  the  residue  photoresist  may  cause 
contamination  of  the  melt.  The  samples  are  cleaned  again  by  standard 
cleaning  procedure  before  the  prebaking  and  diffusion. 

Prebaking.  Diffusion  and  Polishing 

Basically  these  processes  for  2-D  waveguides  are  the  same  as  1-D 
waveguides.  Prebaking  oxidizes  the  surface  of  the  aluminum,  which 
helps  strengthen  the  mask  film  on  the  sample.  Longer  diffusion  time 
and/or  higher  temperature  are  used  for  fabricating  single-mode  2-D 
channel  waveguides  than  1-D  planar  waveguides.  After  diffusion,  the 
side  view  of  sample  appears  as  shown  in  Fig.  4-3(g),  where  the 
waveguide  region  is  shown  by  the  shaded  area.  In  the  polishing,  the 
edge  to  be  polished  must  be  exactly  normal  to  the  channels  except  for 
some  special  applications  where  an  inclined,  polished  edge  is  needed. 
Since  the  aluminum  mask  is  still  on  the  sample,  the  alignment  for 
polishing  can  be  easily  done  under  the  microscope. 

Removal  of  Mask 

The  aluminum  mask  is  removed  by  immersion  in  warm  PAE 
etchant  around  80  C for  half  an  hour  or  longer.  The  undesired  residue 
aluminum  in  the  waveguide  region  may  increase  the  waveguide  loss. 
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The  AI2O3  film  can  also  be  removed  by  the  PAE  etchant.  However, 
nitric  acid  is  used  for  this  purpose.  The  sample  is  immersed  in  dilute 
nitric  acid  for  about  30  minutes  and  then  rinsed  in  DI  water.  After 
removing  the  mask,  the  waveguides  must  be  cleaned  by  the  standard 
procedure  prior  to  the  characterization. 

4.2.  Characterization 

The  optical  waveguides  are  characterized  by  their  index  profile, 
mode  index,  field  profile,  losses,  and  spectral  response  for  the  quasi- 
TE  and  the  quasi-TM  modes.  While  all  the  techniques  are  presented  in 
this  chapter,  the  technique  to  measure  the  spectral  response  will  be 
discussed  in  chapter  6.  Since  the  substrate  index  as  well  as  the 
characteristics  of  the  waveguide  depend  on  the  wavelength,  it  is 
important  to  measure  these  characteristics  by  a monochromatical  light 
source,  or  preferrably  a laser  source.  Most  of  the  measurements  were 
done  using  a laser  diode  at  1.3  pm  wavelength  since  the  lowest 
dispersion1  occurs  at  this  wavelength  in  optical  fibers.  A He-Ne  laser 
at  0.633  pm  wavelength  is  generally  used  for  alignment. 

4.2.1.  Mode  Index  of  Planar  Waveguides 

The  mode  index  of  1-D  waveguides  is  useful  in  determination  of 
the  1-D  index  profile  as  well  as  the  diffusion  coefficient  and  in 
projecting  the  fabrication  conditions  of  single-mode  channel 
waveguides.  In  this  subsection  the  prism  coupler  [Tien69,  Tien70, 
Ulri70]  and  inverse  WKB  method  [Whit76,  Chia85]  are  discussed,  and 

Since  the  line  width  of  light  can  never  be  zero,  dispersion  of  a pulse  is  inevitable. 
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the  results  are  compared  to  FDM  to  determine  the  self-diffusion 
coefficient  of  the  silver  ions.  This  methodology  can  be  applied  to  the 
other  ion-exchange  systems  to  obtain  the  self-diffusion  coefficient. 
This  method  to  determine  the  self-diffusion  coefficient  is  easier  than 
the  conventional  technique  using  the  isotropic  tracer  diffusion. 

Prism  Coupler 

The  prism  coupler  technique  utilizes  a high-index  prism  to 
launch  the  light  into  a waveguide.  It  has  the  ability  to  selectively  excite 
only  one  of  the  guided  modes  in  a multimode  waveguide.  Compared  to 
the  the  grating  coupling,  the  prism-coupler  method  is  simpler  and 
easier  since  no  additional  sample  preparation  is  needed.  Of  course,  the 
coupling  into  a specific  mode  depends  on  the  coupling  condition.  The 
principle  of  operation  is  illustrated  in  Fig.  4-5  and  briefly  explained 
below. 

Suppose  that  the  incident  angle  0 at  the  base  of  the  prism  is 
larger  than  the  critical  angle  0c=sin'1(na/np),  where  na  and  np  are  the 
index  of  the  air  and  the  prism,  respectively.  A standing  wave  in  the 
direction  normal  to  the  prism  base  is  formed  due  to  the  superposition 
of  the  incident  wave  and  the  totally  reflected  wave  in  the  prism.  The 
energy  flow  of  the  standing  wave  is  in  the  direction  parallel  to  the 
waveguide  surface.  The  propagation  constant  of  the  input  beam  in  the 
horizontal  direction  is 


Pp  = k0  np  sin  0 , 


(4.1) 
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Fig.  4-5.  Schematic  diagram  of  a prism  coupler.  The  light  (shaded  region)  is  launched  into  the 
waveguide  through  the  prism.  The  waveguide  is  tightly  clamped  against  the  prism. 
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where  k0  is  the  wave  number  in  vacuum.  If  pp  is  the  same  as  the 
propagation  constant  of  a guided  mode,  then  the  wave  in  the  prism 
may  couple  into  that  mode  in  the  waveguide  according  to  coupled  - 
mode  theory.  The  above  coupling  condition  can  be  expressed  as 

(3p  = k0  np  sin  0 = k0N  , (4.2) 

where  N is  the  mode  index  and  can  be  evaluated  by  measuring  the 
coupling  angle  0. 

The  gap  which  controls  the  coupling  efficiency  is  itself 
controlled  by  the  pressure  on  the  pin-point  support  below  the 
waveguide  and  must  be  adjusted  to  give  optimal  coupling.  If  the  gap  is 
too  wide,  then  no  coupling  occurs  to  the  waveguide  modes.  For  single- 
mode glass  waveguides,  the  gap  is  of  the  order  of  half  a wavelength. 
The  adjustment  by  varying  the  pressure  on  the  support  is  somewhat 
critical.  To  achieve  this  thickness,  both  the  prism  and  the  waveguide 
surface  must  be  clean  and  flat  to  avoid  damaging  the  waveguide  or  the 
prism.  The  control  of  position  of  the  coupling  spot  is  also  important  to 
launch  the  maximum  power  into  the  waveguide.  The  position  must  be 
chosen  so  that  the  energy  of  the  guided  light  in  the  waveguide  does 
not  transfer  back  to  the  prism.  Note  that  from  Eq.  (4.2),  np  must  be 
larger  than  the  largest,  fundamental  mode  index  N0.  Since  N0  are 
different  for  different  waveguides,  the  best  way  is  to  choose  np  > ns, 
the  surface  index  of  the  waveguide.  We  used  a prism  made  by  SF18 
glass  with  refractive  indices  of  1.71653  and  1.69171  at  0.6328  and  1.3 
pm  wavelength,  respectively.  The  glass  has  relatively  high  hardness 
and  high  transmission  at  both  the  visible  and  the  near-infrared 
wavelength.  The  prism  coupler  technique  is  not  applicable  in  the  case 
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of  the  buried  waveguides  since  the  coupling  may  be  very  weak  for  the 
embedded  waveguides. 

Experiment  and  Measurement 

The  experimental  setup  is  shown  in  Figs.  4-6(a)  and  (b),  where 
figure  (a)  shows  the  light  source  part  and  (b)  shows  the  prism  coupler 
and  detection  part.  Two  linearly  polarized  lasers  are  used  in  the  setup; 
a 1.3  pm  wavelength  semiconductor  laser  and  a 0.633  pm  He-Ne  laser. 
Only  one  source  is  used  at  a time;  the  other  one  is  blocked.  The  visible 
He-Ne  laser  is  employed  for  alignment.  A chopper  and  a compensator 
are  inserted  at  position  A (B)  for  measurement  at  1.3  (0.633)  pm.  The 
compensator  adjusts  the  polarization  so  that  linearly  polarized  light 
can  be  launched.  The  microscope  objective  lens  OBJ1  is  used  for 
collimating  the  output  from  the  semiconductor  laser;  OBJ2  is  used  for 
coupling  into  the  fiber,  while  OBJ3  (Fig.  4-6(b))  is  used  for  coupling 
out  of  the  fiber  and  focusing  at  the  base  of  the  prism.  The  distance 
between  the  fiber  and  the  prism  is  about  50  cm  so  that  the  focused 
beam  is  almost  collimated  at  the  prism  base.  An  XYZ  transitional  stage 
(not  shown  in  the  figure)  is  used  to  adjust  the  position  of  the  coupling 
spot  on  the  prism  base  so  that  the  output  is  maximized  and  the 
position  of  the  coupling  spot  is  unchanged  as  the  prism  is  rotated.  The 
optimal  position  of  the  prism  and  the  coupling  spot  for  a collimated 
gaussian  beam  are  discussed  by  Ulrich  [Ulri70]  and  Tamir  [Tami79]. 
The  objective  OBJ4  images  the  output  at  a germanium  detector 
(DET1)  through  a slit.  The  signal  from  the  detector  is  amplified  and 
measured  by  a calibrated  power  meter  and  a lock-in  amplifier  is  used 
to  digitize  it.  Then  the  signal  is  sent  to  a microcomputer  through  an 
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Fig.  4-6.  Prism  coupler  measurement  setup,  (a)  Light  source  part.  The  light  is  launched  into 
a fiber  coupler  and  split  into  a power  monitor  and  measurement  part  denoted  as  C 
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Fig.  4-6  - continued,  (b)  Measurement  part.  The  light  output  from  the  fiber  is  focused  at  a 
prism.  The  signal  is  sent  to  a computer  through  a power  meter  and  lock-in 
amplifier. 
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RS-232C  interface.  In  figure  (a),  another  germanium  DET2  is  used  to 
monitor  for  laser  power. 

To  measure  the  angle  precisely,  the  stepping  motor  controller  is 
reset  to  zero  when  the  reflected  light  from  the  prism  front  surface 
passes  through  the  aperture  and  refocuses  on  the  fiber,  thus, 
establishing  the  reference.  The  angle  0t  shown  in  Fig.  4-5  can  be  read 
directly  from  the  stepping  motor  controller  and  corresponds  to  a peak 
in  the  power  detected  by  DET1.  0i  is  positive  if  the  incident  beam  lies 

between  the  normal  to  the  prism  and  its  base.  The  mode  index  is 
given  by  [Tami79] 


N = np 


Sin 


a + sin 


-i/sin9jl 
I nP  IJ  • 


(4.3) 


where  a is  the  angle  between  the  base  and  the  incident  face  of  the 
prism.  Three  factors  determine  the  accuracy  of  N:  0it  a,  and  np.  0j  can 
be  measured  accurately  to  within  a minute  in  radian  which 
corresponds  to  an  accuracy  of  10'4  in  the  mode  index.  The  other  two 
factors  a and  np  depend  on  the  precision  in  the  manufacturer’s 
specifications.  The  polarization  fluctuation  in  the  fiber  caused  by  the 
environment  may  reduce  the  measurement  accuracy  of  the 
birefringent  waveguides.  The  resolution  of  the  measurement  limits  the 
maximum  number  of  modes  measurable  in  a highly  multimode  graded- 
index  waveguide  where  the  difference  between  the  mode  indices 
becomes  smaller  for  higher-order  modes.  Therefore,  a chopper  and  a 
lock-in  amplifier  in  the  setup  are  recommended  to  decrease  the  noise 
and  improve  the  resolution. 
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Inverse  WKB  method 

As  described  in  the  last  chapter,  the  mode  indices  of  a graded- 
index  waveguide  can  be  evaluated  by  the  WKB  method.  On  the  other 
hand,  the  inverse  WKB  method  is  used  to  construct  the  index  profile 
from  given  mode  indices.  Since  the  number  of  measured  modes  is 
limited  as  pointed  out  in  the  last  subsection,  a continuous  index 
profile  can  not  be  exactly  obtained.  However,  the  method  provides 
some  useful  information  on  planar  waveguides  e.g.,  the  maximum 
index  change  Anmax  and  the  depth  of  the  index  profile.  Obviously,  the 
larger  the  number  of  modes,  the  higher  the  accuracy  in  the 
constructed  index  profile. 

The  characteristic  equation  of  a graded-index  waveguide, 
expressed  in  Eq.  (3.38),  can  be  rewritten  as 

fXm  ^ ,o 

k0  I [n2(x)-N^]1/2dx  = m7t+  E-  + & . 

Jo  4 2-  (4-4) 

We  have  assumed  that  <))s  and  <|)c  are  ^ and  ^ , respectively,  and  the 

index  profile  n(x)  is  monotonically  decreasing  from  the  surface.  For 
the  fundamental  mode  (m=0),  and  the  above  equation  becomes 

fx o 

k ojo  [n2(x)-N^]1/2dx  = 3a  ,4.5) 

Suppose  that  the  index  function  is  approximated  to  be  a linear 
function  between  n(x)=ns  and  n(x)=No: 

n(x)=ns-(^)x. 


(4.6) 
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where  the  maximum  index  at  the  glass  surface  is  ns  and  xo  is  the 
turning  point  of  the  fundamental  mode.  ns  is  determined  in  two  ways 
as  will  be  described  later.  Substituting  Eq.  (4.6)  into  Eq.  (4.5),  we 
obtain 


37t 

4kp 


(ns  - N0) 


nsVnf-N§  + Ng  log 


No  ||  ‘ 
ns  + Vni-Nf|/ 


(4.7) 


(4.8) 


The  first  turning  point  xo  can  be  evaluated  by  Eq.  (4.8).  For  the  first 
higher  mode  m=l,  Eq.  (4.4)  is  expressed  as 


Ztl 

4 


= ko 


rx  i 

= ko  I Vn2-N?dx 
JO 

fx 0 fX\ 

Vn2-N?dx  + I Vn2-N?d 

Jo  Jxn 


(4.9) 


where  xi  is  the  turning  point  for  the  first  higher  mode.  Since  the 
turning  point  for  the  fundamental  mode  is  found  by  Eq.  (4.8),  the  first 
integral  of  the  right  hand  side  of  Eq.  (4.9)  from  x=0  to  x=xo  can  be 
evaluated.  Again  a linear  approximation  is  used  to  interpolate  the  index 
profile  between  x=xq  and  xi  in  Eq.  (4.9)  as  follows: 


n(x)  - No  - ^ j (x-Xo)  for  x0<x<x1.  (4.10) 
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The  turning  point  xi  can  be  calculated  by  substituting  Eq.  (4.10)  into 
Eq.  (4.9).  Thus  n(x)  is  assumed  to  be  a piecewise  linear  function  with 
connected  points  at  xm.  The  procedure  is  repeated  for  other  turning 
points.  As  a result,  the  approximate  piecewise-linear  index  profile  can 
be  obtained. 

Errors  in  the  calculated  index  profile  may  be  induced  from  the 
estimation  of  ns  as  well  as  the  accuracy  in  the  measured  mode  indices. 
We  have  to  estimate  the  surface  index  precisely  to  obtain  an  accurate 
index  profile.  Two  methods  are  used  to  estimate  the  index  at  the 
surface  [Whit76,  Chia85].  The  value  of  ns  mainly  affects  the  accuracy  of 
the  lower  order  modes  and  not  the  higher  order  modes  [Whit76].  The 
index  profile  evaluated  from  incorrect  ns  may  have  very  large  error 
especially  near  the  surface.  Therefore,  White  and  Heidrich  [Whit76] 
proposed  a method  to  optimize  the  smoothness  of  the  index  profile  by 
evaluating  the  minimal  value  of  the  sum  of  the  triangular  areas  defined 
by  each  of  the  three  successive  turning  points  xt,  Xk+i,  and  Xk+2  and 
the  associated  mode  indices  Nk,  Nk+i,  and  Nk+2  (k=0,  1,...),  e.g., 

{(x=0,  ns),  (xq,  N0),  (xi,  Ni)},  {(xo,  N0),  (xi,  Ni),  (x2,  N2)} etc.  Note 

that  the  first  set  of  the  three  points  correspond  to  the  surface  index, 
the  fundamental  mode  index,  and  the  first  higher  mode  index.  For  a 
waveguide  with  five  modes,  there  are  four  sets  of  triangular  points. 
Since  the  number  of  modes  is  limited,  the  computing  time  for 
minimizing  the  total  area  is  short.  This  method  can  usually  give 
satisfactory  results  even  when  the  measured  mode  indices  are  not  very 
accurate.  This  minimal-area  technique  is  good  for  profiles  with  large 
variation  at  the  surface,  e.g.,  erfc  profile,  exponential  profile,  etc. 
Chiang  [Chia85]  discussed  an  extrapolation  technique  to  estimate  the 
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surface  index  ns  by  a given  set  of  measured  mode  indices  and  the 
corresponding  mode  orders.  Although  Eq.  (4.4)  is  valid  for  non- 
negative integers  (m)  only,  it  is  allowed  to  have  non-integral  values  for 
prediction  of  m which  gives  the  hypothetical  turning  point  at  the 
surface  with  the  maximum  index  ns.  Suppose  that  the  upper  limit  of 
the  integral  on  the  LHS  of  Eq.  (4.4)  is  at  the  surface  x = 0.  In  this  case 
the  LHS  equals  zero  and  m = -0.75,  indicating  that  the  hypothetical 
mode  having  the  turning  point  at  the  surface  should  have  m = -0.75. 
Thus,  the  index  at  the  surface  can  be  obtained  by  extrapolating  the 
given  sets  of  points  with  the  integer  mode  order  m and  the  mode 
index  Nm  to  m = -0.75.  An  effective -index  profile  N(m)  as  a function  of 
m can  be  constructed  by  inter-  and  extra-polating  the  points  between 
the  measured  points  i.e.,  m=0,  1,  2,...  . With  N(m)  a nearly  continuous 
function,  a smooth  index  profile  n(x)  can  be  obtained  by  using  the 
procedure  described  earlier  in  this  subsection.  However,  the 
extrapolation  may  not  give  an  accurate  ns,  resulting  in  a round-off 
error  in  the  index  profile  at  surface.  Therefore,  this  extrapolation 
technique  is  good  for  evaluating  ns  in  those  profiles  which  show 
slower  variations  near  the  surface,  such  as  gaussian  or  parabolic  index 
profiles.  Usually,  consistent  results  are  obtained  by  the  two  methods 
above  in  the  case  of  Ag+-Na+  exchange  in  BK7  glass.  The  predicted 
index  profiles  are  usually  more  accurate  in  the  region  where  the  index 
is  close  to  the  substrate  index.  Hence,  the  width  of  the  reconstructed 
index  profile  can  be  precisely  determined  by  the  inverse  WKB  method 
using  either  of  the  two  methods. 
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4.2.2.  Experimental  Procedure  to  Determine  the  Diffusion  Coefficient 

The  silver  diffusion  profile  of  the  waveguide  can  be  analyzed  using 
an  atomic  absorption  spectrophotometer  (AAS)  [Chlu87],  an  electron 
microprobe  [Zhen89],  or  a scanning  electron  microscope  (SEM) 
[Rama86,  Lagu86a].  Alternatively,  multimode  planar  waveguides  can  be 
characterized  optically  by  measuring  the  mode  indices  with  a prism 
coupler  as  described  earlier  and  the  index  profile  (concentration 
profile)  can  be  constructed  by  the  inverse  WKB  method.  Figure  4-7 
shows  the  normalized  concentration  (index)  profile  measured  by 
various  techniques  for  NAg=2xlO"4  MF  in  the  soda-lime  glass 
demonstrating  excellent  agreement  in  the  results.  As  mentioned  in 
chapter  2,  a complementary  error  (erfc)  function  (solid  line  in  the 
same  figure)  is  obtained  for  low  concentration  of  the  silver  ion  in  the 
glass.  The  width  (Weff)  of  the  profile  at  0.1573  points,  yielded  from 
the  comparison,  equals  2VmDAt  (section  2.2.3),  where  t is  the  given 
diffusion  time  and  m is  taken  to  be  1.49  in  this  glass[Rama88d].  Thus, 
hiDa  is  evaluated  by  (Weff/2)2/t,  which  is  concentration-independent 
for  low  silver  concentration  in  glass.  The  double  alkali  effect  is 
neglected  in  this  case. 

The  waveguides  which  support  one  or  two  modes  have  either  a 
small  depth  or  a low  concentration  of  silver  ions  (small  An).  In  this 
case  the  silver  concentration  profile  determined  by  the  above  methods 
is  somewhat  inaccurate  due  to  the  resolution  of  the  measurements. 


Ag+-Na+  Exchange  in  SLS 
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Fig.  4-7.  Normalized  silver  concentration  profile  in  soda-lime  glass.  The  measured  data 
points  excellently  fit  to  the  erfc  function  (solid  line)  for  concentration  0.1. 
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The  inverse  WKB  is  also  not  applicable  since  one  or  two  modes  are 
insufficient  to  construct  a profile.  In  this  case  computer  simulation 
[Rama88a,  Chen90a]  is  used  to  model  the  index  profile.  First,  the  1-D 
nonlinear  diffusion  equation  (Eq.  2.22)  is  solved  by  the  finite 
difference  method  to  obtain  the  index  profile.  The  unknown 
parameters  used  in  the  calculation  are  the  self-diffusion  coefficients  of 
silver  and  sodium  ions  and  NAo.  the  mole  fraction  of  the  silver  ions  at 
the  surface  (see  Eq.  (2.24)).  The  parameters  are  substituted  by  a set  of 
guessed  values.  The  calculated  index  profile  is,  then,  used  in  solving 
the  Helmholtz  equation  to  attain  the  mode  indices  by  the  finite 
difference  method.  In  this  step  the  unknown  parameter  is  the 
maximum  index  change  at  surface  of  the  waveguide.  The  index  at  the 
surface  is  assumed  to  be  the  same  as  that  of  a highly  multimode 
waveguide.  The  solved  eigen  values  (mode  indices)  are  compared  to 
the  mode  indices  measured  by  the  prism  coupler.  If  the  mode  indices 
do  not  match,  a new  set  of  the  guessed  parameters  are  used.  By  trial- 
and-error,  we  thus  fit  the  calculated  mode  indices  to  the  measured 
ones.  Once  they  are  matched,  the  self-diffusion  coefficient  of  the  silver 
ions  is  obtained. 

Figures  4-8(a)  and  (b)  show  the  normalized  index  profile  for  the 
one-step  diffusion  without  and  with  an  applied  field,  respectively.  The 
solid  lines  represent  the  theoretical  curves  and  the  discrete  points 
denote  the  measured  mode  indices  at  the  positions  calculated  by  the 
inverse  WKB  method.  The  ion  exchange  of  Ag+  with  Na+  in  soda-lime 
silicate  glass  was  carried  out  at  330°C  with  2X1  O'4  MF  AgN03  in 
NaN03  mixture.  The  silver  concentration  in  the  glass  is  less  than  0.1 
MF.  The  index  profile,  shown  in  Fig.  4-8(a),  is  expected  to  be  similar 
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Fig.  4-8.  Fitting  of  mode  indices  by  the  inverse  WKB  method.  The  discrete  points  are  evaluated 
by  the  inverse  WKB  method  from  measured  mode  indices.  The  solid  lines  are  the 
analytical  results,  (a)  Diffusion  without  applied  field. 
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Fig.  4-8  - continued,  (b)  Diffusion  with  applied  field. 
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to  the  erfc  function  for  diffusion  without  the  external  field.  There  are 
five  modes  in  this  waveguide  with  a diffusion  time  of  120  minutes.  The 
diffusion  coefficient  is  found  to  be  0.15  pm2/min  in  this  case.  The 
other  curve  in  Fig.  4-8(b),  showing  zero  slope  at  the  surface,  is  the 
case  of  an  applied  field.  The  diffusion  is  performed  for  30  minutes 
under  an  electric  field  of  45  V/mm.  Five  modes  are  observed  in  this 
waveguide.  The  best  fit  is  obtained  with  DA=0.04  pm2/min  and  pA=5.0 
pm  /V*min.  For  the  same  temperature  the  diffusion  coefficient  is 
smaller  for  the  case  of  the  applied  field.  The  reason  is  that  the  space- 
charge  effect  is  enhanced  by  the  applied  field  since  the  mobilities  of 
silver  and  sodium  ions  are  not  equal.  As  discussed  previously  in 
chapter  2,  the  modeling  assumes  that  the  space-charge  density  is 
negligible  and,  therefore,  the  diffusion  coefficient  obtained  by  the 
theory  is  not  the  correct  one.  However,  for  the  case  of  the  diffusion 
without  an  applied  field,  the  fitting  method  yields  accurate  results  as 
will  be  demonstrated  in  the  next  chapter. 

4-2.3.  2-D  Mode  Field  and  Loss  Measurement 

Mode  field  profile  measurement  is  of  paramount  importance  in 
characterizing  single-mode  waveguides  and  devices.  The  mode  field  of 
the  guide  determines  the  fiber-waveguide  coupling  loss,  the 
performance  of  the  directional  coupler,  etc.  The  light  source  setup  is 
the  same  as  shown  in  Fig.  4-6(a).  However,  the  difference  in  this  setup 
is  that  the  light  from  the  fiber  coupler  is  launched  into  the  waveguide 
directly,  as  shown  in  Fig.  4-9.  A single-mode  fiber  is  used  to  couple  the 
light  into  either  the  TE  or  TM  mode  of  the  waveguide.  The 
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Fig.  4-9.  The  nearfield  measurement  setup.  The  light  source 
part  C is  shown  in  Fig.  4-6(a). 
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polarization  of  the  light  is  achieved  by  adjusting  the  compensator  at 
position  A or  B (Fig.  4-6(a))  so  that  the  linearly  polarized  light  is 
obtained  at  output.  The  fiber  must  be  kept  as  steady  as  possible  to 
prevent  polarization  fluctuations  during  the  measurement.  The 
waveguide  output  is  collected  through  a high-quality  planar  objective 
(OBJ3)  by  imaging  the  output  on  a germanium  detector  (DET1)  with  a 
10  pm  pinhole  mounted  in  front  of  the  detector. 

The  2-D  near-field  intensity  pattern  is  measured  by  scanning  the 
detector-pinhole  assembly  horizontally  and  vertically  by  the  stepping 
motor.  Notice  that  the  waveguide  image  is  inverted  by  OBJ3  and 
magnified  by  about  65  times.  The  signal  from  DET1  is,  then,  sent  to  a 
lock-in  amplifier  through  a power  meter/current  amplifier.  A 
computer  is  interfaced  with  the  lock-in  amplifier  to  transfer  the  data 
from  the  amplifier.  Often  more  than  500  points  are  sampled  in  each 
scan  of  about  1200  pm.  Since  the  diameter  of  the  pinhole  is  10  pm,  an 
unnecessarily  large  number  of  points  does  not  increase  the  resolution 
of  the  measured  intensity  profile.  Furthermore,  too  many  data  points 
may  occupy  a large  space  in  storage  disk  and  slow  down  the  data 
processing  speed.  However,  the  averaging  of  a sufficient  number  of 
sampling  points  may  smooth  out  the  random  noise  in  the  signal. 

The  Fabiy-Perot  effect  caused  by  the  Fresnel  reflections  at  the 
fiber-waveguide  gap  can  be  observed  by  the  output  power  oscillation 
when  adjusting  the  gap  separation.  Maximum  transmission  is  obtained 
when  the  gap  separation  equals  an  odd  number  of  quarter  wavelengths 
since  the  index  of  the  substrate  is  similar  to  that  of  the  fiber.  This 
requires  submicron  reproducibility  of  the  transitional  stage.  The 
stability  of  the  holders  of  the  single-mode  fiber,  waveguide,  and 
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objective  OBJ3  is  also  important  for  reproducible  measurements.  A 
movement  of  1 pm  in  the  waveguide  or  OBJ3  will  be  magnified  at  the 
detector  and  may  result  in  a shift  of  65  pm  in  the  image  since  the 
magnification  factor  is  65.  By  removing  the  waveguide,  the  near-field 
intensity  profile  of  the  single-mode  fiber  is  also  measured  using  the 
same  setup.  The  horizontal  and  vertical  profiles  of  the  circularly 
symmetrical  fiber  must  be  the  same  and  they  are  symmetrical  if  the 
fiber  face  is  carefully  prepared. 

The  magnification  of  the  system  is  determined  as  follows.  Since 
there  are  several  single-mode  waveguides  on  the  same  substrate  and 
the  separation  of  the  straight  channel  waveguides  is  known,  the 
magnification  is  measured  by  first  aligning  the  detector  at  the 
maximum  intensity  when  the  fiber  launches  light  into  a waveguide 
(WG1)  and  then  displacing  the  fiber  laterally  across  the  substrate  to 
the  next  waveguide  (WG2)  and  measuring  the  distance  through  which 
the  pinhole  has  to  be  translated  to  collect  the  light  from  WG2.  The 
magnification  factor  is  obtained  by  dividing  the  translation  distance  of 
the  pinhole  by  the  separation  between  WG1  and  WG2.  After 
determining  the  magnification  factor,  the  width  and  depth  intensity 
profiles  are  obtained.  The  near-field  profile  is  calculated  by  taking  the 
square  root  of  the  intensity  profile. 

The  loss  of  the  waveguide  can  be  measured  by  the  same  setup 
without  the  pinhole.  First,  the  power  output  (Pin)  from  the  single- 
mode fiber  is  measured.  Next,  the  waveguide  is  inserted  between  the 
fiber  and  OBJ3.  The  power  (Pout)  is,  then,  measured  by  maximizing 
the  fiber-waveguide  coupling.  In  this  case,  in  addition  to  the  coupling 
and  waveguide  propagation  losses,  only  the  reflection  at  the  waveguide 
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output  end  contributes  to  the  insertion  loss.  The  reflection  of  the 
objective  OBJ3  is  the  same  during  the  loss  measurement  with  and 
without  the  waveguide  and,  hence,  it  is  not  considered  in  the  loss 
measurement.  The  total  insertion  loss  of  the  waveguide  equals  10 
l09l0(Pout/Pin)  (dB). 

As  pointed  out  earlier,  the  total  insertion  loss  consists  of  three 
primary  loss  mechanisms:  1)  Fresnel  loss  due  to  reflection  of  the 
optical  field  at  the  waveguide  output  end,  2)  mode  mismatch  loss  due 
to  the  difference  in  mode  fields  of  the  single-mode  fiber  and  channel 
waveguide,  and  3)  propagation  loss  resulting  from  the  surface 
scattering  and  absorption  in  the  waveguide.  The  Fresnel  loss  of  the 
waveguide  due  to  the  glass-air  interface  is  determined  by  the  substrate 
index.  By  assuming  normal  incidence  and  a small  An,  the  loss  is 
estimated  to  be  10  log[4nt>/(nb+l)2]  = -0.18  dB  for  BK7  glass  nb  = 
1.50371  at  1.3  pm  wavelength.  The  mode  mismatch  loss  is  calculated 
by  maximizing  the  overlap  integral: 
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where  Eflx.y)  and  Eg(x,y)  are  the  mode  field  profiles  of  the  fiber  and 
waveguides,  respectively.  The  mode  mismatch  loss  is  given  by 
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10  logio(ri)  {dB}.  Figure  4-10  shows  the  measured  near-field  intensity 
profile  for  the  fiber  (solid  line)  and  a surface  waveguide  in  the 
horizontal  (parallel  to  the  waveguide  surface,  denoted  as  V)  and 
vertical  (normal  to  the  surface,  denoted  as  ‘o’)  directions.  It  is  clearly 
seen  that  due  to  the  asymmetry  in  the  index  profile  the  mode  field 
intensity  profile  in  the  vertical  direction  is  asymmetrical  and  has 
narrow  width  compared  to  the  symmetrical  profile  in  the  horizontal 
direction.  The  mismatch  loss  is  0.22  dB  calculated  by  Eq.  (4.11)  in 
this  case. 

Once  the  reflection  and  mode  mismatch  losses  are  determined, 
the  propagation  loss  of  the  waveguide  can  be  obtained  by  subtracting 
them  from  the  overall  fiber-waveguide  insertion  loss.  The  lowest 
propagation  loss  of  a single-mode,  silver-exchanged  surface-waveguide 
device  in  BK7  glass  achieved  to  date  is  0.15  ±0.1  dB/cm  [Chen90b].  If 
the  propagation  loss  is  below  0.1  dB/cm,  then  the  technique 
described  above  is  not  adequate.  Instead  a resonant  structure,  e.g., 
Fabry-Perot  interferometer,  ring  resonator  [Eguc83,  Conn87],  etc. 
must  be  used  for  an  accurate  loss  measurement. 

The  reflection  loss  is  usually  inevitable  without  using  an  index 
matching  fluid  and,  thus,  it  imposes  a fundamental  lower  limit  on  the 
insertion  loss.  The  mode  mismatch  loss  can  be  reduced  by  adjusting 
the  maximum  index  change  and  the  fabrication  condition  or  even 
burying  the  waveguide  with  a two-step  diffusion  process  [Kane85, 
Lagu86a,  Kane86,  Rama86,  Ross86,  Chen88].  Recently,  Pantchev 
[Pant87],  Zhenguang  et  al.  [Zhen88],  Gunther  and  Jestel  [Gunt88],  and 
Gevorgyan  [Gevo90]  demonstrated  novel  single-step  diffusion 
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Fig.  4-10.  Near-field  intensity  profiles  of  2-D  silver-exchanged  surface  waveguide  and 
Fiber.  The  mismatched  loss  is  0.22  dB  between  the  waveguide  and  fiber. 
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processes  to  fabricate  buried  waveguides  with  an  applied  field.  Such 
developments  may  be  necessary  to  simplify  the  process  and,  as  a 
result,  reduce  the  cost  of  fabricating  ion-exchanged  waveguides. 

4.3.  Summary 

In  this  chapter  the  fabrication  processes  for  1-D  and  2-D  Ag+- 
Na+  exchanged  waveguides  were  described  in  detail.  The 
characterization  of  1-D  waveguides  by  the  prism  coupler  measurement 
and  inverse  WKB  method  allows  determination  of  the  diffusion 
characteristics  and  the  associated  index  change.  Characterization  of 
the  channel  waveguides  including  the  mode  field  and  the  loss 
measurement  were  also  addressed.  These  characterization  techniques 
are  used  in  the  simulation  of  the  tapered  waveguides  in  the  next 
chapter. 


CHAPTER  5 

MODELING  OF  TAPERED  WAVEGUIDES 
5.1.  Introduction 

Tapered  channel  waveguides  are  extremely  useful  in  integrated 
optical  devices.  Often  they  are  used  as  interconnections  between  a 
single-mode  fiber  and  a waveguide  device  with  different  mode  sizes  to 
reduce  the  mode  mismatch  loss  since  the  tapered  guide  provides  a 
smooth  transition  between  them.  In  addition,  an  efficient  short  taper 
design  permits  the  use  of  small  devices  resulting  in  an  increased 
packing  density.  Suchoski  and  Ramaswamy  [Such87]  reported  on 
tapers  fabricated  in  Ti:LiNb03.  Mahapatra  and  Connors  [Maha88a] 
demonstrated  a low  loss  thermal  taper  using  Ag+-Na+  exchange  in 
Corning  0211  glass.  Recently,  Zhenguang  et  al.  [Zhen89]  achieved  low 
loss,  single-mode  waveguides  with  near  adiabatic,  fiber-compatible 
tapers  at  both  ends  in  BK7  glass,  which  is  an  important  step  before 
glass  waveguides  become  useful  in  passive  system  applications.  The 
understanding  of  the  evolution  of  the  index  change,  channel  (guide) 
size,  modal  field,  and  propagation  constant  is  of  paramount 
importance  in  the  design  of  low  loss  tapers.  A model  that  tracks  the 
above  parameters  as  a function  of  the  taper  length  and  fabrication 
conditions  is  essential  to  gain  a complete  understanding  of  the 
performance  of  the  tapered  devices. 
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In  this  chapter,  we  first  present  a systematic  study  of  modeling 
of  the  planar  waveguides,  followed  by  a model  for  straight  channel 
waveguides  and  tapered  transitions  in  BK7  glass  [Chen89b,  Chen90a]. 
Using  1-D  multimode  planar  waveguides,  we  determine  the  diffusion 
coefficient,  index  profile,  and  surface  index  change.  The  calculated 
diffusion  coefficient  and  estimated  index  change  are  in  good 
agreement  with  the  measured  values.  These  parameters  are  employed 
for  the  2-D  modeling  of  straight  channel  waveguides  of  different 
widths.  Again,  the  results  are  compared  to  the  experimental  data  to 
ensure  the  accuracy  of  the  modeling. 

The  first  step  of  the  taper  fabrication  process  is  similar  to  that 
described  in  the  last  chapter  (Fig.  5- 1(a)).  In  the  second  step,  which 
we  call  postbaking,  thermal  tapers  are  formed  by  nonuniform  diffusion 
under  a thermal  gradient  along  the  waveguide  direction  since  the 
diffusion  coefficient  is  a function  of  temperature,  as  shown  in  Fig.  5- 
1(b).  Therefore,  the  tapered  waveguide  dimension  in  the  propagation 
direction  can  be  controlled  by  varying  the  temperature  along  this 
direction.  The  thermal  gradient  causes  the  maximum  index  change  at 
the  surface  to  decrease  and  the  size  of  the  waveguides  to  increase 
along  the  taper.  In  order  to  analyze  the  tapered  waveguide,  the  taper  is 
sectioned  longitudinally  with  its  own  distinct  waveguide  parameters  in 
each  section  as  a 2-D  channel  waveguide.  The  index  change  is  assumed 
to  be  proportional  to  the  silver  concentration,  and  the  negligible 
birefringence  observed  in  these  guides  [Zhen89]  demonstrates  that 
they  are  nearly  stress  free  [Bran86].  Hence,  the  index  changes  at  the 
surface  are  the  same  for  both  the  TE  and  TM  modes.  The  2-D 
nonlinear  diffusion  equation  Eq.  (2.21)  for  the  ion  exchange  and  the 
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(a)  straight  channel  waveguide 
(by  ion-exchange) 


(b)  tapered  waveguide 

(by  postbaking  the  straight  guide) 


taper 

retflnn 


Fig.  5-1.  Schematic  diagram  of  (a)  a straight  channel  waveguide 

fabricated  by  a single  step  binary  ion  exchange  and  (b)  a tapered 
guide  by  postbaking  a straight  channel  under  a thermal  gradient. 
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2-D  Helmholtz  wave  equation  Eq.  (3.36)  are  numerically  solved  by  FDM 
as  discussed  in  chapter  3.  An  Arrhenius  type  relation,  Eq.  (2.40),  is 
used  to  represent  the  self-diffusion  coefficient  of  silver  ions  for  the 
nonuniform  temperature  diffusion.  The  results  show  that  the 
calculated  index  profile  and  mode  sizes  along  the  taper  agree  very  well 
with  the  measured  values  for  the  cases  of  straight  channel  waveguides 
as  well  as  the  tapers.  The  agreement  shows  that  the  model  can  be 
used  to  predict  the  evolution  of  the  index  profile,  propagation 
constant,  and  size  of  the  modal  field  in  single  step  as  well  as  two-step 
ion-exchanged  waveguides.  The  radiation  loss  of  the  tapered 
waveguides  is  evaluated  by  BPM  for  various  taper  profiles,  e.g.,  linear, 
quadratic,  raised-sine  etc.  as  functions  of  taper  length  in  the  z 
direction  [Chen89a].  It  is  also  useful  in  predicting  the  mismatch  loss 
in  a coupler  between  devices  using  tapered  transition. 

5.2.  Modeling 

As  shown  in  Fig.  5-2,  the  modeling  includes  three  parts:  the 
modeling  of  planar  waveguides,  channel  waveguides,  and  tapered 
waveguides.  Simulations  and  experiments  were  done  simultaneously  to 
ensure  the  accuracy  of  the  modeling. 

5.2.1.  Planar  Waveguides 

The  multimode  planar  waveguide  is  modeled  first,  to  get  the 
fitting  parameters,  which  are  used  in  the  simulation  of  the  channel 
and  tapered  waveguides  as  shown  in  Fig.  5-2.  The  1-D  index  profile  of 
the  planar  guide  in  the  x-direction  is  obtained  by  first  solving  the  1-D 
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Flow  Chart  for  the  Study  of 
Ion-Exchanged  Guides 


^simulations  (^experiments 


Fig.  5-2.  Flow  chart  of  the  modeling  of  planar,  channel,  and  tapered 

waveguides  that  includes  the  simulation  and  correlation  with 
experimental  results. 


146 


nonlinear  diffusion  equation  Eq.  (2.22)  by  FDM.  The  initial  and 
boundary  conditions  are  expressed  by  Eq.  (2.24).  However,  a close- 
form  solution  is  not  available  in  this  case.  The  fitting  parameters  are  a, 
Nao.  Dao,  and  AH  for  1-D  waveguides.  NAo  depends  on  the  ion- 
exchange  equilibrium  condition  [Rama88d]  at  the  glass  surface.  To 
choose  appropriate  values  of  a and  NAo  for  the  initial  fitting 
parameters,  we  first  analyzed  the  index  profiles  which  were 
reconstructed  by  the  inverse  WKB  method  from  the  mode  indices  of 
multimode  waveguides  measured  by  the  prism  coupler  technique  as 
described  in  the  last  chapter.  The  magnitude  of  aNA0  determines  the 
slope  of  the  index  function  at  the  surface.  The  initial  values  of  NA0  and 
a for  the  fitting  of  the  calculated  and  experimental  index  profiles  can 
be  estimated  from  the  slope  of  the  rebuilt  index  profile  obtained  using 
the  inverse  WKB  method,  so  that  the  fitting  parameters  NA0  and  a are 
readily  obtained.  The  self-diffusion  coefficient  DA  dictates  the  width  of 
the  index  profile  or  the  depth  of  the  waveguide.  Thus,  the  technique 
involves  choosing  NA0  and  a first  and  varying  DA  until  the  depth  of 
diffusion  is  matched.  Then  NA0  and  a are  adjusted  again  to  achieve  the 
best  fit  of  the  index  profile.  Figure  5-3  shows  both  the  calculated  index 
profile  (solid  line)  and  the  experimental  profile  (cross-points)  in  BK7 
glass.  The  diffusion  was  performed  for  48  hours  at  330°C  in  a melt  of 
0.33  MF  KN03,  0.67  MF  NaN03,  and  0.001  MF  AgN03.  The  best  fitting 
values  of  NA0  and  a are  found  to  be  0.9  and  0.99,  indicating  that  the 
self-diffusion  coefficient  of  the  silver  ions  is  two  order  of  magnitude 
less  than  that  of  sodium  ions.  The  surface  index  change  Anmax,  which 
is  linearly  proportional  to  NAo,  is  estimated  to  be  0.02  at  0.633  pm 
wavelength  for  the  above  melt  composition.  The  cross-points  denote 
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Fig.  5-3.  Index  profiles  of  a multimode  planar  waveguide  reconstructed  by  an  inverse 
WKB  method  (cross-points)  from  the  prism-coupler  measurement  and  solved 
by  FDM  (solid-line).  The  fabrication  condition  is  described  in  the  text. 
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the  reconstructed  profile  of  a multimode  planar  waveguide  using  the 
inverse  WKB  method  employing  the  measured  mode  indices.  For  this 
case,  the  self-diffusion  coefficient  of  silver  ions  in  BK7  glass  at  330°C 
was  estimated  to  be  2.9X1  O'3  pm2/min.  The  results  show  a gaussian- 
like  profile  for  the  silver  exchanged  waveguide  in  BK7  at  this 
concentration.  Several  multimode  waveguides  were  fabricated  at 
different  diffusion  temperatures,  and  in  each  case  the  self-diffusion 
coefficient  (Da)  was  evaluated  using  the  above  procedure,  resulting  in 
the  temperature  dependence  of  DA.  Figure  5-4  shows  the  calculated 
self-diffusion  coefficient  of  silver  ions  in  BK7  as  a function  of  the 
diffusion  temperature  for  the  above  melt.  The  coefficients  DA0  and  AH 
in  the  Arrhenius  relation  Eq.  (2.40)  are  found  to  be  3.705X1 06 
pm  /min  and  105.1  kJ/mole,  respectively.  The  activation  energy  is 
close  to  that  reported  by  Eguchi  et  al.,  98.7  kJ/mole  [Eguc83]. 

Multimode  planar  waveguides  allow  the  determination  of  NA0,  a, 
DA0,  and  AH.  These  values  are  used  for  channel  simulation  as  shown  in 
Fig.  5-2.  The  self-diffusion  coefficient  of  Ag+  in  channel  waveguides  is 
assumed  to  be  the  same  as  that  of  the  planar  waveguides. 

5.2.2.  Channel  Waveguides 

The  2-D  diffusion  equation  is  also  numerically  solved  by  FDM. 
However,  the  boundary  conditions  are  modified  to  accommodate  the 
diffusion  through  the  mask  opening.  In  the  mask  opening  region,  the 
concentration  of  cation  A at  the  surface,  is  the  same  as  in  the  planar 
waveguide  case,  namely,  NAq.  Under  the  mask,  the  concentration 
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Fig.  5-4.  Self-diffusion  coefficient  DAg  4im2/min)  of  silver  ion 
in  BK7  as  a function  of  temperature. 
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gradient  of  ion  A is  set  to  zero  in  the  vertical  direction  at  the  air-glass 
interface: 


3Na 
d x 


[x=0 


= 0. 


(5.1) 


The  numerical  accuracy  of  the  results  is  checked  by  comparing  with  a 
1-D  analytical  solution,  which  is  a complementary  error  function  for 
a=0  as  discussed  in  chapter  2.  The  index  profile,  obtained  from  the 
diffusion  simulation  for  channel  waveguides  by  using  the  parameters 
(DAo  and  AH)  from  our  1-D  modeling,  is  compared  with  the 
concentration  profile  of  single-mode  channel  waveguides,  obtained  by 
an  electron  microprobe,  as  outlined  in  the  flow  chart  in  Fig.  5-2. 

The  2-D  Helmholtz  equation  Eq.  (3.36)  is  solved  by  FDM  with 
the  assumption  that  the  field  vanishes  at  the  outer  calculation 
boundary.  The  separation  between  the  outer  boundaries  in  both  the 
horizontal  and  vertical  directions  is  chosen  to  be  much  larger  than  the 
widths  of  the  mode  fields  so  that  the  above  assumption  is  valid.  Finally, 
the  mode  fields  of  the  channel  waveguides  calculated  by  FDM  are 
compared  to  the  near-field  measurements.  In  the  experiment, 
considerable  effort  was  put  into  fabrication  of  the  single-mode 
waveguides  since  they  have  wide  applications  in  the  optical 
communication  system.  Hence,  in  the  simulation  of  channel  and 
tapered  waveguides,  we  focus  only  on  the  fundamental  mode. 

5.2.3.  Tapered  Waveguides 

In  the  simulation  of  the  second  step,  i.e.,  postbaked  waveguides, 
the  first  step  concentration  of  ion  A is  used  as  the  input.  The 


151 


concentration  gradient  of  ion  A in  the  vertical  direction  at  the  air-glass 
interface  is  set  to  zero,  as  before,  in  the  region  under  the  mask.  Since 
the  width  of  the  waveguide  becomes  larger  after  postbaking  and  the 
total  number  of  cations  A remains  constant,  the  maximum 
concentration  of  ions  at  the  surface  decreases  with  the  maximum 
index  change  still  occurring  at  the  surface.  As  described  previously, 
the  tapered  guide  was  sectioned  along  the  longitudinal  direction  for 
modeling  purposes  with  each  section  having  a lower  Anmax  and  larger 
waveguide  size  than  the  straight  channel  waveguide  depending  on  the 
temperature  during  postbaking  (and  hence  location)  along  the 
waveguide  direction.  The  same  procedure  outlined  previously  for  the 
channel  waveguide  was  used  to  simulate  each  section  making  up  the 
taper.  The  simulation  results  of  the  two-step  process  are  again 
checked  by  the  electron  microprobe  data  (procedure  indicated  in  the 
flow  chart  in  Fig.  5-2).  As  before,  the  2-D  Helmholtz  equation  was 
solved  to  determine  the  mode  fields  and  mode  indices  in  each  section. 

5.2.4.  Results  and  Comparison  with  Experiment 

In  this  subsection  we  present  the  simulation  results  on  Ag+-Na+ 
exchanged  waveguides  and  tapered  transitions  and  compare  them 
with  the  experimental  results.  The  full  widths  of  the  index  profile  at 
1/e  points  in  the  horizontal  and  vertical  directions  represented  by  dy 
and  dx,  respectively,  are  shown  in  Fig.  5-5  as  a function  of  the  mask 
width  WM  for  straight  surface  channels.  The  surface  channels  are 
made  at  340°C  for  12  hours  with  10'3  MF  silver  nitrate  in  the  melt. 
The  open  rectangles  □ show  the  index  profile  widths  measured  by  the 
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Fig.  5-5.  1/e  width  (dy,  dashed  line)  and  1/e  depth  (dx,  solid  line)  of  the  index 
profile  vs.  the  mask  opening.  The  open  squares  with  the  error  bars 
are  measured  values  using  an  electron  microprobe. 
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electron  microprobe  for  mask  openings  Wm  = 3,  5,  and  6 pm  [Zhen89]. 
The  electron  microprobe  data  are  not  as  accurate  as  is  desired  due  to 
the  destructive  nature  of  the  measurements.  In  other  words,  if  the 
width  of  the  profile  is  measured,  the  depth  can  not  be  measured 
accurately.  This  error  decreases  with  increasing  mask  width  as  shown 
by  the  error  bars  in  the  figure.  Figure  5-6  shows  the  lateral  diffusion 
defined  as  (dy  - Wm)  vs.  Wm  and  represented  by  the  dashed  line.  The 
solid  line  shows  the  diffusion  depth  of  the  surface  channels 
normalized  to  the  depth  of  the  planar  waveguide  dp  fabricated  under 
identical  conditions.  The  lateral  diffusion  tends  to  saturate  at  ~4  pm 
for  mask  openings  larger  than  3 pm.  Therefore,  dy  increases 
proportionally  with  Wm  so  that  (dy  - Wm)  saturates  for  Wm  > 3 pm.  As 
seen  in  Fig.  5-6,  dx/dp  is  limited  by  the  mask  opening  and  approaches 
a value  of  only  0.75  for  Wm  = 6 pm.  (The  waveguides  are  multimode  for 
Wm  > 6 pm.)  The  variation  of  the  full  widths  of  the  mode  fields  at  1/e 
intensity  points  in  the  horizontal  and  vertical  directions,  Wh  and  Wv, 
respectively,  is  shown  in  Fig.  5-7.  The  discrete  points  represent  values 
obtained  by  the  near-field  measurement.  The  aspect  ratio  defined  as 
Wh/Wv  is  usually  larger  than  one  for  surface  waveguides.  The  mode  size 
increases  at  small  mask  openings  since  the  guides  are  near  cutoff.  Wv 
decreases  monotonically  and  approaches  the  mode  size  of  the 
corresponding  planar  waveguide  as  Wm  increases.  Wh  first  decreases 
and  increases  again  for  wider  mask  openings,  which  shows  that  the 
modes  are  near  cutoff  for  narrower  channels  and  well  confined  for 
wider  channels  in  the  horizontal  direction. 

The  results  described  so  far  give  insight  into  designing  tapers  by 
varying  the  mask  width  only.  The  channels  become  multimoded  for 
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Fig.  5-6.  Lateral  diffusion  (dy-WM,  dashed  line)  and  normalized  depth  (dx/dP,  solid  line) 

as  functions  of  the  mask  opening  Wm,  where  dP  is  the  1/e  diffusion  depth  of  the 
surface-planar  waveguide  fabricated  in  the  same  conditions. 
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Fig.  5-7.  1/e  widths  of  the  inode  fields  in  the  horizontal  direction  (Wh,  dashed  line)  and  the 
vertical  direction  (Wv.  solid  line)  as  a function  of  mask  opening.  The  open  squares 
are  data  points  obtained  by  the  near  field  measurement. 
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Wm  > 6 pm,  which  implies  that  a single  mode  waveguide  with  tapered 
mode  size  in  the  longitudinal  direction  is  difficult  to  make  in  one  step 
by  varying  only  the  mask  opening.  Tapered  waveguides  fabricated  by 
just  varying  the  mask  width  are  expected  to  approach  cutoff  at  the 
narrow  end  or  become  multimoded  at  the  wide  end.  Figure  5-8  shows 
the  diffusion  depth  dx  as  a function  of  the  diffusion  time  in  the  first 
step  process  for  the  planar  guide  as  well  as  for  channels  of  WM  = 8 and 
3 pm.  The  processing  parameters  are  the  same  as  those  for  the  data 
presented  in  Figs.  5-5  to  5-7,  except  that  the  diffusion  time  is  allowed 
to  vary  now.  The  diffusion  times  of  single-mode  operation  with  8 and 
3 pm  mask  openings  are  between  ~90  and  230  min  and  180  and 
830  min,  respectively.  The  diffusion  time  for  the  single-mode 
waveguide  with  WM  =8  pm  requires  more  accurate  control  than  that  of 
the  channel  with  Wm  = 3 pm.  Single-mode  tapered  waveguides  with 
WM  from  3 pm  to  8 pm  in  the  longitudinal  direction  can  be  obtained  by 
diffusing  the  silver  ions  in  the  interval  between  180  and  230  min.  For 
mask  opening  covering  more  than  this  range,  the  control  of  diffusion 
is  even  more  critical  to  fabricate  single-mode  tapered  waveguides.  If 
the  narrow  end  of  the  tapered  waveguide  is  close  to  cutoff,  the  mode 
size  may  not  monotonically  increase  along  the  longitudinal  direction 
even  though  the  waveguide  is  single  mode.  Furthermore,  the 
scattering  loss  is  expected  to  be  larger  for  the  waveguides  or  tapers 
near  cutoff. 

In  the  postbaking  technique  presented  here,  the  control  of  the 
postbaking  time  to  make  a single-mode  tapered  waveguide  is  not  as 
critical  as  that  of  the  diffusion  time  in  the  single  step  process  (Fig.  5- 
8),  resulting  in  better  controllability  and  assured  reproducibility.  The 
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Fig.  5-8.  Calculated  depths  of  index  profile  dx  as  a function  of  the 
diffusion  time.  dx  for  planar  waveguide  and  the  channel 
waveguides  for  WM=  3 and  8 fim  fabricated  in  the  same 
condition.  The  arrows  show  the  single  mode  regions  for 
the  channel  waveguides  at  X = 1.3  |im. 
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calculated  width  of  the  index  profile  in  postbaked  guides  versus 
temperature  is  shown  in  Fig.  5-9.  The  mask  opening  is  chosen  to  be 
3.5  pm  in  the  first  step  because  the  channel  guide  supports  fairly  well 
guided  mode  with  good  mode  circularity  and  low  loss.  The  postbaking 
time  is  5 hours.  Both  the  depth  and  width  increase  as  the  temperature 
increases.  The  experimental  values  before  and  after  postbaking,  shown 
as  open  rectangles,  are  in  excellent  agreement  with  the  calculated 
values  for  dy.  The  figure  shows  that  the  width  of  the  taper  is  not  a 
linear  function  in  the  longitudinal  direction,  although  the  temperature 
distribution  [Zhen89]  was  nearly  linear.  Since  the  total  number  of 
silver  ions  remains  almost  constant  as  a result  of  postbaking,  the 
maximum  index  of  the  waveguide  ns  at  the  surface  decreases  with 
increasing  temperature  (Fig.  5-10)  when  the  waveguide  size  becomes 
larger  (Fig.  5-9).  The  dark  circle  in  Fig.  5-10  denotes  ns  before 
postbaking.  The  normalized  frequency  V (Fig.  5-11),  which  equals 
ko dxV  2nbAnmax  , increases  slowly  with  temperature  compared  to  the 
change  of  widths  of  the  waveguides  (Fig.  5-9).  Again  the  dark  circle 
data  point  denotes  the  V value  before  postbaking.  The  normalized 
mode  index  b = (N2-ng)/(n§-ng)  (Fig.  5-12),  where  N = p/k 0 is  the  mode 
index  and  nb  is  the  bulk  substrate  index,  increases  slightly  at  lower 
temperatures  because  initially  the  mode  index  does  not  drop  as  much 
as  the  surface  index  and  also  the  size  of  the  waveguide  increases. 
However,  at  higher  temperatures,  b decreases  and  moves  closer  to 
cutoff  due  to  a larger  asymmetry  factor  resulting  from  the  lower 
surface  index,  even  though  V increases  (Fig.  5-11).  The  dependence  of 
b on  the  postbaking  temperature  is  useful  in  designing  devices  where 
phase  matching  is  critical,  such  as  directional  couplers,  grating 
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Fig.  5-9.  Width  dy  and  depth  dxof  the  index  profile  for  the  tapered  waveguide  vs. 

postbaking  temperature.  The  open  squares  show  the  data  measured  by 
electron  microprobe,  and  the  solid  lines  are  calculated  values. 
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Fig.  5-10.  Calculated  surface  index  ns  vs.  postbaking  temperature.  The  closed  circle  on  the 
vertical  axis  shows  the  calculated  data  point  prior  to  postbaking.  The  same 
symbols  are  used  for  the  values  before  postbaking  in  Figs.  5-1 1 - 13. 
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Fig.  5-11.  Normalized  frequency  V vs.  the  postbaking  temperature  for  the  tapered 

waveguide.  The  closed  circle  shows  the  calculated  data  before  postbaking. 
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Fig.  5-12.  Normalized  mode  index  b vs.  the  postbaking  temperature.  The  closed 
circle  shows  the  calculated  data  before  postbaking. 
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devices  etc.  The  horizontal  mode  size  Wh  and  vertical  mode  size  Wv 
shown  in  Fig.  5-13  increase  rather  rapidly  with  increasing 
temperature.  The  experimental  data  of  Fig.  5-13,  denoted  by  the  open 
squares,  were  reported  elsewhere  [Zhen89].  The  dark  circles 
represent  the  simulated  modal  widths  of  the  channel  waveguide  prior 
to  postbaking.  The  taper  was  sectioned,  and  the  modal  width  Wh  and 
depth  Wv  were  measured  successively  at  1 mm  intervals  over  a length 
of  5 mm  where  the  temperature  was  raised  from  315  to  385°C  during 
postbaking.  Figure  5-13  shows  that  the  difference  between  the 
numerical  simulation  and  experimental  measurement  is  small.  The 
discrepancy  may  result  from  the  anisotropic  diffusion  in  the  glass  and 
the  error  in  both  the  numerical  calculation  and  measurements. 

5.3.  Simulation  of  Propagation 
5.3.1.  Beam  Propagation  Analysis 

The  index  profile  of  the  tapered  waveguides  is  not  only  a 
function  of  x and  y as  in  straight  channels  but  is  also  a function  of  z 
along  the  propagation  direction,  which  complicates  the  modeling  and 
increases  the  computer  processing  time.  To  simplify  the  simulation,  it 
is  assumed  that  its  index  profile  in  the  lateral  or  y direction  can  be 
converted  from  a two-dimensional  index  profile  by  the  effective  index 
method  (EIM)  to  an  equivalent  1-D  profile  and  fitted  to  a gaussian 
function  to  simplify  the  calculation.  Although  this  conversion  is  not 
valid  for  the  modes  close  to  cutoff,  it  may  provide  us  some  useful 
information  in  designing  a taper  with  minimum  radiation  loss. 


□ Measured  data 
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Fig.  5-13.  Widths  of  the  mode  fields  as  functions  of  the  postbaking  temperature.  The 
solid  lines  show  the  simulation  results.  The  open  squares  denote  the  data 
obtained  by  near  field  measurement.  The  closed  circle  shows  the  calculated 
data  before  postbaking. 
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After  postbaking,  the  index  change  profile  although  broader  and 
weaker,  remains  gaussian  since  the  initial  condition  is  also  gaussian  for 
the  case  of  finite-source  diffusion  as  discussed  in  chapter  2.  The 
maximum  index  change  Anmax  becomes  smaller  and  the  half-width  W 
of  the  index  profile  at  1/e  point  in  the  transverse  direction  becomes 
inversely  proportional  to  Anmax  due  to  the  constant  number  of  the 

dopant  ions  during  postbaking.  Thus,  the  index  profile  can  be  written 
as 


n(y,z)  — ns  + Anmaxi 


Wx 

W(z) 


(5.2) 


where  ns  is  the  substrate  index,  Anmaxi  is  the  maximum  index  change 
at  the  beginning  of  the  taper  (z=0),  W(z)  is  the  half-width  of  the  index 
profile  at  the  1/e  point,  and  Wx  is  the  half- width  at  input  of  the  taper, 
i.e.,  z=0.  The  width  of  the  index  profile  can  be  controlled  by  varying 
both  the  diffusion  temperature  and  the  postbaking  time  as  mentioned 
earlier.  We  assume  that  W(z)  of  the  tapered  index  region  (0  < z < L)  can 
be  represented  by  one  of  the  following  functions: 


1)  linear: 


W(z)=W1  + (W2-Wi)  z/L 


2)  quadratic:  W(z)  = Vw?  + (W?  - W?)  z/L 


(5.3a) 

(5.3b) 


3)  raised-sine:  W(z)  = |(W2  + Wx)  + (W2  - Wx)  sin[g-(z  - ^)]j  /2 


(5.3c) 


where  L is  the  taper  length  and  W2  is  the  half-width  at  the  output  (z  = 
L). 
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The  simulation  of  wave  propagation  in  the  tapered  waveguides  is 
performed  by  the  beam-propagation  method.  As  described  earlier, 
BPM  is  a unified  model  for  simulating  the  propagating  wave  in  a 
graded-index  medium.  This  method  can  be  used  to  evaluate  the 
magnitude  as  well  as  the  phase  of  the  wave.  However,  in  the  simulation 
the  reflected  wave  is  assumed  to  be  negligible  and  the  field  is  assumed 
to  vanish  at  an  arbitrary  boundary  far  away  from  the  waveguide  region 
in  the  transverse  direction  to  avoid  the  error  due  to  the  periodic 
boundary  condition  imposed  by  the  fast  Fourier  transform. 

We  propagate  the  field  from  a straight  channel  waveguide 
section  of  length  Li  (=600  pm)  through  a tapered  transition  of  length 
L to  another  straight  section  of  length  Li . The  field  launched  into  the 
guide  is  obtained  by  solving  for  the  eigen-mode  of  the  first  section  by 
the  finite  difference  method  (FDM)  with  gaussian  index  profile.  The 
waveguide  is  sectioned  in  the  z-direction  and  the  propagating  field  at 
the  end  of  each  section  is  calculated  using  BPM.  For  evaluating  the 
overlap  integral,  the  mode  field  at  the  next  section  is  again  calculated 
using  FDM  and  assuming  that  the  index  n(y,z)  in  the  section  is 
invariant  in  the  z-direction  and  the  phase  front  of  the  mode  field  is  a 
plane  normal  to  the  z-direction.  At  the  end  of  each  section,  we 
calculate  how  much  power  remains  in  the  guided  mode  by  considering 
the  overlap  integral  r|  between  the  mode  field  and  the  propagating 
field  for  each  section  along  the  propagation  direction: 
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T](z)  = 


E(y,z)E6(y,z)dy 


i 


(5.4) 


E0(y,z)E6(y,z)dy  Ei(y,z)Et(y,z)dy 


where  Ex  is  the  initial  field  at  z = -Li,  Eq  is  the  fundamental  mode  field 
at  the  end  of  a given  section  z solved  by  FDM,  E is  the  complex 
propagating  field  solved  by  BPM,  and  the  stars  represent  the  complex 
conjugate  of  the  fields.  This  process  continues  along  the  z direction 
till  z = L + Li  is  reached. 

r\  equals  one  at  the  beginning  of  the  input  section,  i.e.,  at  z = -Lx 
and  decreases  due  to  the  radiation  in  the  transition  as  the  wave 
propagates.  A small  loss  is  also  encountered  at  the  boundary  in  the 
transverse  direction  where  the  field  is  assumed  to  vanish.  The  total 

radiation  loss  is  evaluated  from  the  successive  overlap  integrals  from 
z=  -Lx  to  z = L + Lx. 

5.3.2.  Simulation  Results 

Since  BPM  can  track  the  guided  modes  as  well  as  the  radiation 
modes,  it  can  be  used  to  simulate  abrupt  junction  transitions,  i.e.,  the 
length  of  the  taper  transition  L = 0.  The  index  profile  is  expressed  by 
Eq.  (5.2)  with  Wx  = 2.5  pm  at  input  and  W2  = 5.0  pm  in  the  second 
section  of  the  waveguide.  The  amplitude  of  the  field  along  the 
waveguide  for  z=0  to  Lx  is  shown  in  Fig.  5-14,  where  the  taper  length 
is  zero.  The  mismatch  in  the  mode  sizes  at  the  junction  (z=0)  causes 
part  of  the  energy  of  guided  wave  to  be  coupled  into  the  radiation 


Fig.  5-14.  Amplitude  of  a field  propagating  through  an  abrupt  junction.  The  calculation  width 
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mode  in  the  second  section  as  may  be  seen  from  the  figure.  Some 
ripples  of  the  field  appear  at  the  position  close  to  z = Li  = 600  pm 
indicating  that  the  radiation  may  bounce  back  from  the  boundaries  of 
the  calculation.  This  imposes  a limitation  on  the  calculation  accuracy. 
The  computation  width  in  this  simulation  is  128  pm.  The  width  can  be 
increased  to  avoid  the  above  problem.  However,  the  resolution  of  the 
index  profile  will  be  reduced  if  the  same  number  of  points  is  used.  The 
overlap  integral  r|  in  this  case  equals  0.9588  equivalent  to  0.183  dB 
loss.  Figure  5-15  shows  the  evolution  of  the  wave  propagating  in  a 
raised-sine-profile  taper  with  L = 500  pm.  The  half-width  of  the  index 
profile  increases  from  2.5  pm  at  z = 0 to  5 pm  at  z = L.  The  straight 
section  with  length  of  Li  following  the  taper  is  shown  in  the  figure  but 
the  preceding  straight  section  (z  < 0)  is  ignored.  The  computation 
width  is  the  same  as  Fig.  5-14  and  is  considerably  larger  than  the 
mode  size  of  the  waveguide.  The  power  loss  due  to  radiation  in  the 
taper  is  small  compared  to  the  case  in  Fig.  5-14  and  the  radiation  field 
can  not  be  clearly  seen  from  the  figure.  Figure  5-16  shows  the 
normalized  power  in  the  guided  mode  along  the  propagation  direction 
for  linear  (dashed  curve)  and  raised-sine  (solid  curve)  profile.  The 
dotted  line  represents  the  half-width  of  raised-sine-curve  index 
profile  and  the  tapered  region  is  from  z=0  to  z=500  pm.  The  linear 
and  parabolic  tapered  index  profile  are  not  shown  in  the  figure.  The 
loss  is  higher  for  the  linear  profile  than  for  the  raised -sine  profile  in 
the  beginning  of  the  taper  but  it  becomes  lower  at  the  end  since  the 
maximum  slope  dn[y,z)/dz  is  larger  for  the  sine  function.  The  losses 
are  0.032  dB  and  0.056  dB  for  the  linear  and  raised-sine  tapers, 
respectively,  which  are  much  smaller  than  the  loss  of  abrupt  junction. 
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Fig.  5-15.  The  evolution  of  the  propagating  wave  in  the  raised-sine  curve  taper.  The  tapered  region 
is  between  z=0  and  L,  where  L = 500  pm.  The  length  of  the  straight  region  is  600  pm. 
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Fig.  5-16.  The  power  in  the  guided  mode  along  the  propagation  direction  for  linear  (dashed 
curve)  and  raised-sine  (solid  curve)  profile.  The  dotted  line  represented  the  width 
of  raised-sine-curve  index  profile  and  the  tapered  region  is  from  z = 0 to  500  pm. 
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As  a result,  tapered  transitions  always  help  reduce  the  mismatch  loss. 
The  overlap  integral  r\  of  the  field  at  z=0  almost  equals  unity  indicating 
the  accuracy  of  FDM  in  calculation  of  the  mode  field  as  well  as  BPM  in 
modeling  the  wave  propagtion  since  the  eigen-mode  must  remain 
unchanged  and  the  power  should  be  constant  in  the  straight  channel 
region.  Figure  5-17  shows  the  guided  power  versus  the  length  of  the 
taper  for  both  the  raised -sine  (dashed  line)  and  linear  (solid  line) 
profiles.  The  resulting  power  in  the  guided  mode  for  a parabolic 
profile  (not  shown)  is  slightly  smaller  than  that  for  the  linear  profile. 
As  seen  from  Fig.  5-17,  the  loss  at  the  discontinuity  of  the  index 
gradient  in  the  z-direction  between  the  taper  and  the  input  straight 
channel  waveguide  becomes  dominant  as  the  taper  length  L increases. 
As  a result,  the  raised-sine  profile  suffers  less  loss  than  both  the  linear 
and  parabolic  profiles  for  L > 1200  pm.  For  a raised-sine  profile  taper 
longer  than  2 mm,  the  loss  is  negligible  (Fig.  5-17).  In  the 
experiments  [Zhen89],  the  radiation  loss  is  smaller  than  the 
propagation  loss  and  is  below  the  measurement  limit. 

5.4.  Summary 

Channel  and  tapered  waveguides  have  been  simulated  in  this 
chapter  with  the  parameters  obtained  from  the  planar  waveguide 
modeling.  The  simulation  results  are  in  excellent  agreement  with  the 
experimental  measurements  of  the  mode  size  as  well  as  index  profile 
width  along  the  taper.  The  modeling  of  wave  propagation  is  performed 

by  BPM,  which  shows  that  the  radiation  loss  is  negligible  for  a 5 mm 
long  taper. 
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Fig.  5-17.  Power  output  vs.  taper  length  for  the  linear  and  raised-sine  taper  profiles. 


CHAPTER  6 

DIRECTIONAL  COUPLERS  AS  WAVELENGTH 
MULTIPLEXERS/DEMULTIPLEXERS 

6.1.  Introduction 

There  are  several  devices  which  use  the  directional  coupler 
configuration  to  perform  such  roles  as  optical  modulators  and  switches 
[Thyl88,  Koai86,  Koge76],  wavelength  demultiplexers  [Verb88, 
Suzu89,  Chen90b,  Chen90e],  power  dividers  [Yip84].  As  such 
directional  couplers  have  very  wide  applications  in  the  optical 
communication  systems.  The  guided -wave  wavelength  division 
multiplexers/demultiplexers  provide  opportunities  for  increasing  the 
transmission  capacity  of  coherent  and  other  fiber-optic  systems 
[Glan90].  The  wavelength  demultiplexers  have  been  realized  by  using 
symmetric  directional  couplers  consisting  of  two  identical  straight 
channel  waveguides  [Imot87,  Suzu89,  Chen90e]  and  asymmetric 
directional  couplers  with  non-identical  guides  [Alfe78,  Brob86]. 

In  the  previous  chapter  we  presented  a systematic  modeling  of 
index  profiles  and  mode  fields  for  channel  and  tapered  waveguides.  In 
this  chapter,  we  extend  these  techniques  to  model  symmetric 
directional  couplers  in  graded-index  waveguides  as  dual  wavelength 
multiplexers/demultiplexers  and  verify  the  results  by  fabricating  and 
characterizing  these  devices  by  Ag+-Na+  exchange  in  BK7  glass. 

The  coupled-mode  theory  is  used  for  predicting  the 
characteristics  of  the  symmetric  directional  couplers  and  the  effective 
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index  method  is  employed  to  solve  the  modal  properties  of  both  the 
channel  waveguides  and  the  symmetric  directional  coupler.  Modeling 
results  are  used  to  fabricate  demultiplexers  operating  at  1.32/1.56  pm. 
We  achieved  a demultiplexer  with  a propagation  loss  of  0.15  dB/cm 
and  a total  insertion  loss  of  1.25  dB  for  a 14.5  mm  device  with 
crosstalk  of  -30  dB  and  -32  dB  at  1.315  and  1.561  pm,  respectively.  In 
each  case,  the  modeling  results  were  used  to  optimize  the  device 
performance. 


6.2.  Coupled-Mode  Theory 

When  two  individual  waveguides  are  brought  close  to  each  other, 
the  guided  modes  in  the  individual  waveguides  exhibit  coupled  mode 
behavior;  the  propagation  characteristics  of  the  coupled-system,  e.g., 
mode  field  and/or  mode  index,  are  altered.  When  the  coupling  is 
“weak”,  the  coupled-mode  theory  accurately  predicts  the  behavior  of 
the  coupled  waves  as  well  as  the  transfer  of  power  between  the 
waveguides  along  the  propagation  direction. 

The  coupled-mode  theory  was  presented  by  Miller  [Mill54]  in 
1954  for  two  coupled  transmission  lines.  The  theory  was  applied  to 
optical  waveguide  to  describe  guided-wave  nonlinear  optical 
interactions,  electro-optic  switching  etc.  by  Yariv  [Yari73].  The 
coupled-mode  theory  discussed  by  Miller  [Mill54]  and  Yariv  [Yari73] 
can  be  applied  only  to  the  weakly  coupled  waveguides  i.e.,  when  the 
separation  between  the  waveguides  is  relatively  large.  Suematsu  and 
Kishino  [Suem77]  treated  the  case  of  two  strongly  coupled  waveguides 
and  showed  that  the  degree  of  power  transfer  between  the  waveguides 
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depends  strongly  on  the  coupling  strength.  As  the  coupling  becomes 
larger,  less  power  is  transferred.  Recently,  a set  of  improved  or 
vectorial  coupled-mode  theories  [Marc86a,  Chua87a,  Chua87b, 
Haus87a,  Stre87a,  Stre87b,  Vass87,  Vass88,  Haus89]  were  reported 
and  experimentally  verified  [Marc86b].  Most  of  them  are  applicable  in 
the  case  of  strong  coupling.  However,  we  will  consider  only  the  case  of 
weak  coupling,  since  the  scalar  coupled-mode  theory,  applicable  in 
this  case,  is  easy  to  use  and  gives  sufficiently  accurate  results. 

6.2.1.  Coupled-Mode  Theory 

Suppose  we  have  two  weakly  coupled  waveguides.  The  fields  in 
the  waveguides  propagating  in  the  z-direction  in  the  composite 
structure  can  be  expressed  as 

Fi(y,  z)  = ai(z)  ffy)  exp(-jpiZ)  , (6.1) 

where  the  subscripts  i (=1  or  2)  represent  one  of  the  waveguides;  a(z) 
is  the  complex  amplitude  of  the  field;  My)  is  the  mode  field  of  the 
waveguide  i with  the  propagation  constant  pi  and  y is  the  transverse 
direction.  The  fields  are  normalized  such  that  the  power  carried  by 
each  waves  are  given  by  ai  af  (i=l  or  2),  where  the  superscript  * 
represents  the  complex  conjugate.  In  the  uncoupled  case,  the  guided 
modes  must  maintain  the  same  power  in  the  waveguides;  afs  are  z- 
independent  constants.  Otherwise,  the  z-dependent  amplitudes  (ai 
and  a.2 ) obey  the  following  relations  [Yari73]: 
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= -jK2ia2  exp  (-2j  5z) 

(6.2) 

and  ^2-  = -jKi2ai  exp  (+2j  5 z)  t 

Cl  z 

where  5 = (P2-Pi)/2  and  K12  and  k2i  are  the  coupling  coefficients,  given 
by  [Lee86] 


- Vn i (y) ) t/2/1)  dy 


and 


= 2/ 


Ki2  = ^ (Vn(y)  -Vn2(y) ) (/i/2)  dy  , 


(6.3) 


where  co  is  the  angular  frequency,  n(y)  is  the  index  of  the  entire 
(composite)  waveguide  system  and  ni(y)  are  the  index  of  the 
uncoupled  waveguides.  In  general,  ki2  is  not  equal  to  k2i  [Hard85, 
Marc86a,  Marc86b].  Nevertheless,  when  the  guides  are  close  to 
synchronism  (5  = 0)  and/or  far  apart  (weak  coupling),  the  relation  ki2  = 
K2i  is  reasonably  satisfied.  For  the  boundary  conditions:  ai(z=0)  = Ao 
and  a2(z=0)  = 0,  the  solutions  of  Eq.  (6.2)  are  given  by  [Lee86,  Koge88] 


adz)  = Ao 


cos(Kz)  +J^sin(Kz) 

K 


exp(-j5z) 


a2(z)  = Aq 


-J'K12 

K 


sin(Kz) 


exp(+j5z) 


(6.4) 


where  Ao  is  a constant,  K = ^5 2+ki2k2i  and  5 = (p2-pi)/2.  In  the  case  of 
a coupler  with  two  identical  waveguides,  5 = 0, 
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Kl2  = K2l  = K , 


(6.5) 


and  K = k.  Therefore,  the  complex  amplitudes  are  given  by 


ai(z)  = Aq  COS(kz) 


a2(z)  = -jAo  sin(Kz)  , 


(6.6) 


Notice  that  the  fields  in  the  individual  waveguides  have  a n/ 2 phase 
difference.  The  normalized  powers  in  the  individual  waveguides  are 
expressed  as 


Thus,  the  power  propagating  along  a symmetric  coupler  is  transferred 
back  and  forth  sinusoidally  between  the  two  waveguides.  The 
normalized  power  is  unity  in  waveguide  1 and  zero  in  waveguide  2 at  z 
= 0 and  power  transfers  completely  to  waveguide  2 at  z = tc/ (2k).  The 
interaction  length  required  to  have  complete  power  transfer  is 
defined  as  the  coupling  length  lc  = tc/(2k).  Equation  (6.6)  can  be 
rewritten  in  terms  of  lc  as 


Pi  = — ^ = COS2(kz) 


a3 


p2  = a2^2  = sin2(Kz) 


(6.7) 


A3 


(6.8) 


3 dB  power  divider  is  achievable  for  the  interaction  L = ^ 

z 

this  case  the  power  is  equally  split  at  the  output. 


K 

or  — .In 
4k 
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Since  the  individual  waveguides  are  identical,  the  phase 
matching  condition  is  always  satisfied  at  all  wavelengths  for  the 
symmetric  directional  coupler.  However,  this  does  not  mean  the 
bandwidth  for  complete  power  transfer  is  infinite.  The  limitation  on 
bandwidth  is  imposed  by  the  wavelength-dependent  coupling 
coefficient  and  coupling  length  as  will  be  discussed  later.  As  a result, 
waves  of  different  wavelengths  are  split  at  the  output  of  the  directional 
coupler. 

6.2.2.  Relation  between  Normal-Modes  and  Counled -Modes 

Coupled-mode  theory  solves  the  coupled  system  in  terms  of  the 
eigen-modes  of  individual  waveguides  and  the  coupling  coefficients 
between  them.  However,  it  is  valid  only  in  the  case  of  weak  coupling. 
On  the  other  hand,  the  normal-mode  approach  solves  the  eigen-modes 
of  the  entire  waveguide  system.  Therefore,  this  approach  is  applicable 
for  the  cases  of  arbitrary  coupling.  If  the  waveguides  are  weakly 
coupled,  the  eigen-solutions  of  the  normal  modes  can  be  expressed  in 
terms  of  that  of  the  coupled-mode  and  the  coupling  coefficients. 

Consider  a symmtric  directional  coupler  consisting  of  two 
single-mode  waveguides.  We  define  a normal-mode  field  F°  as  a linear 
combination  of  the  uncoupled  mode  fields  as  follows: 

F°(y,z)  = A\[z)f  i(y)e'JPlZ  + A2(z)f2{y)e-W*z  , (6.9) 

where  Ai  and  A2  are  functions  of  z.  Since  F°  is  the  normal-mode  field, 
it  follows  the  relation 

F°(y,z)  = A/°(y)e-JP°z 

* 


(6.10) 
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i.e., 


dEl  = .jgo  po 

dz 


(6.11) 


where  A is  a constant,  f°( y)  is  the  transverse  normal-mode  field,  and 
P°  is  the  propagation  constant  corresponding  to  that  normal  mode. 
Substituting  Eq.  (6.9)  into  (6.11),  one  obtains 

fl  _jPA1+jp°Ai)  + f2  (^-  -jpA2+jp°A2)  = 0 . (6.12) 


where  we  have  assumed  a symmetric  directional  coupler,  i.e.,  p = p!  = 
P2.  In  the  above  equation,  only  fi  and  fi  are  functions  of  y.  Therefore, 
for  nonzero  / 1 and  /2,  Eq.  (6.12)  can  be  written  as  two  separate 
equations: 


-jPA,+jP°A,  = 0 
^ -jpA2+jp°A2  = 0 . 


Applying  the  coupled-mode  equation  Eq.  (6.2)  for  Ai  and  A2  in  this 
case,  Eq.  (6.13)  can  be  written  as 


-jKAa-jpAi+jf^A!  = 0 
-j  kAi  -j  PA2+j  (3°A2  = 0 


(-P+P°)Ai-kA2  = 0 
-kA1+(-3+|30)A2  = 0 . 


(6.14) 


For  nontrivial  solutions  for  A^z)  and  A2(z), 


-P+P°  -K 
-K  -p+p° 


i.e.. 


(6.15) 


P°  = P±K, 


(6.16) 
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which  relates  the  propagation  constants  of  the  two  normal  modes  to 
those  of  the  individual  waveguide  modes.  The  propagation  constants  of 
the  fundamental  (even)  and  the  first-higher  (odd)  normal  modes  are 
thus  given  by 


and 


(3+  = p + K 

P'  = P " K . 


(6.17) 


respectively.  When  the  separation  of  the  waveguides  is  large,  the 
coupling  is  weak  and  p+  and  p"  are  close  to  the  propagation  constant  of 
uncoupled  mode  (p).  As  the  separation  decreases,  the  coupling 
becomes  stronger  and,  as  a result,  p+  increases  and  p'  decreases.  It 
must  be  pointed  out  that  Eq.  (6.17)  is  not  applicable  in  the  strong 
coupling  case  since  in  this  case,  the  normal  mode  description  given  by 
Eq.  (6.9)  is  not  valid.  Note  that  the  coupling  coefficient  k=  (p+-P')/2. 
For  the  symmetric  directional  coupler,  Eq.  (6.6)  can,  thus,  be 
expressed  as 


ai(z)  = Ao  COS 
a2(z)  = -jAo  sin 


(6.18) 


The  relations  between  Ai  and  A2  are  obtained  by  substituting  Eq. 
(6.16)  into  Eq.  (6.14)  as 


A:(z)  = A2(z) 
and  Ax(z)  = -A2(z) 


(6.19) 


Applying  Eq.  (6.2)  again,  we  have 


for  p+  = P+k, 
for  P'  = p-K  . 
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dAi 

dz 

dA] 

dz 


= -jxAi 
= jxAi 


which  have  solutions 


for  p+ 
fcrp' , 


(6.20) 


Ai(z)  = Ao  exp(-jKz)  for  p+ 

Ai(z)  =Aoexp(jKz)  forp',  (6'21) 

where  Ao  is  constant.  The  normal  mode  fields  for  the  fundamental  and 

first  higher  modes  are  obtained  by  substituting  Eq.  (6.21)  into  Eq. 
(6.9)  as 


F+(y,z)  - Ao  [/i(y)+/2(y)J  e^*z  s Ao  /+(y)  e^'2  (6.22a) 

and  F-(y,z)  = Ao  [/,(y)-/2(y)]  e * s Ao  / ( y)  e**  , (6.22b) 

respectively.  Equation  (6.22a)  depicts  that  the  fundamental  normal 
mode  field  if*)  of  the  symmetric  directional  coupler  can  be  expressed 
as  an  in-phase  superposition  of  the  two  individual  mode  fields  /j  and 
/2  with  equal  amplitudes.  Similarly,  Eq.  (6.22b)  depicts  that  the  first 

order  normal  mode  field  CT)  is  an  out  of  phase  superposition  of  the 
two  individual  mode  fields. 

Figure  6-1  shows  a schematic  of  a directional  coupler  consisting 
of  two  identical  straight  channel  waveguides,  where  S is  the  separation 
between  two  channel  waveguides  measured  from  the  center  of  one 
waveguide  to  that  of  the  other  and  WM  is  the  width  of  the  mask 
opening.  The  mode  fields  are  shown  in  the  figure  marked  as  p,  p+  and 
P . representing  the  fundamental  mode  of  the  individual  waveguides. 
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c/} 


Fig.  6-1.  Uncoupled  and  normal  modes  of  a symmetrical  directional  coupler. 
Notations  are  defined  in  the  text. 
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and  the  fundamental  and  first  order  mode  of  the  directional  coupler, 
respectively.  It  is  obvious  that  /*( y)  is  the  summation  of  the  individual 
mode  fields  / 1 (y)  and  /2(y);  / (y)  is  the  difference  of  /i(y)  and  /2(y). 
Conversely,  in  the  case  of  weak  coupling,  the  mode  fields  of  the 
individual  waveguides  can  also  be  written  as  a summation  or  difference 
of  two  normal  modes  of  the  entire  structure  by  using  Eq.  (6.22)  and 
(6.10).  Substituting  V or  into  the  superscript  ‘o’  in  Eq.  (6.10),  we 
obtain  /i(y)  = ^ (/+(y)+/'(y))  and  /2(y)  = ^ (/+(y)-/"(y)).  In  other 

words,  the  wave  input  from  one  of  the  waveguides  can  be  decomposed 
into  two  normal  modes  with  equal  powers.  These  two  normal  modes, 
having  different  phase  velocities  or  propagation  constants,  interfere 
with  each  other  as  they  propagate  along  the  directional  coupler.  If  the 
total  energy  launched  into  one  of  the  guides  is  coupled  equally  to  these 
two  normal  modes,  the  energy  transfers  to  the  other  channel  at  a 
distance  where  the  difference  in  phases  of  the  two  normal  modes  is 
(2m-l)7i,  where  m is  a positive  integer.  If  the  phase  difference  is  2mrc, 
the  energy  shifts  back  to  the  original  channel.  The  coupling  length  lc 
is  the  length  required  to  achieve  a relative  phase  shift  n between  two 
normal  modes  propagating  along  the  directional  coupler  and  is  given 
by  lc  = 7t/(P+-p')  = 7i/(2k).  Therefore,  if  the  interaction  length  (L)  of  the 
directional  coupler  equals  an  odd  number  of  coupling  lengths,  the 
superposition  of  the  normal  modes  yields  complete  energy  transfer 
Similarly,  power  transfer  back  to  the  original  waveguide  occurs  when  L 
equals  an  even  number  of  coupling  lengths.  We  do  not  consider  the 
case  of  strong  coupling  since  complete  power  transfer  between  the 
two  identical  waveguides  of  the  coupler  is  not  possible  [Suem77J. 
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6.3.  Modeling 

In  this  section  we  present  and  discuss  the  results  of  the 
modeling  of  the  index  profile,  and  design  considerations  for 
directional  coupler  as  a wavelength  demultiplexer.  The  results  are 
verified  by  fabrication  and  characterization  of  symmetric  directional- 
coupler  devices  as  described  later. 

6.3.1.  Index  Profile 

It  is  well  known  that  in  graded  index  waveguides,  the  modal 
profile  is  somewhat  insensitive  to  the  index  profile  [Rama83]. 
However,  the  index  profile  affects  the  dispersion  relation  as  well  as 
the  propagation  constant.  A knowledge  of  the  index  profile  prior  to 
the  simulation  of  waveguides  is  of  paramount  importance.  In  the  case 
of  silver  ion-exchanged  waveguides  in  BK7  glass,  the  simulation, 
presented  in  the  last  chapter,  shows  a gaussian-like  index  profile. 
Thus,  we  assume  that  the  index  profile  of  the  channel  is  a semi- 
infinite gaussian  function  in  the  depth  direction.  The  assumption  of  a 
gaussian  index  profile  in  our  model  is  also  applicable  to  some  other 

cases,  for  example,  K+-Na+  exchanged  waveguides  in  soda-lime  glass 
[Mili89J. 

Since  the  diffusion  is  isotropic  in  the  width  (horizontal) 
direction  as  well  as  in  the  depth  (vertical)  direction,  the  index  profile 
in  the  horizontal  direction  is  also  assumed  to  be  Gaussian  in  shape.  In 
the  case  of  channel  waveguides,  this  is  especially  true  if  the  dimension 
of  the  diffused  profile  in  the  horizontal  direction  is  much  larger  than 
the  width  of  the  mask  opening.  Hence,  the  contours  of  equal 
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concentrations  are  semicircles.  The  assumption  is  reasonable  in  the 
case  of  Ag+-Na+  exchange  in  BK7  [Chen90a]  also,  as  evidenced  by  the 
excellent  correlation  between  the  theoretical  and  experimental  results 
presented  later.  Based  on  the  above  assumptions,  the  equation  for  the 
2-D  index  profile  is  given  by 


where  x = 0 represents  the  surface,  X is  the  wavelength  in  vacuum, 
nb(^)  is  the  wavelength-dependent  substrate  index,  Anmax  is  the 
maximum  index  change  of  the  guide  and  W is  the  half-width  of  the 
channel  waveguide  at  the  1/e  point.  In  the  simulation,  the  dispersion 
or  the  wavelength-dependent  index  of  the  substrate  glass  is  taken  into 
account.  However,  the  maximum  index  change  Anmax  is  assumed  to  be 
constant  in  the  operating  wavelength  region.  This  assumption  is 
acceptable  as  long  as  no  absorption  occurs  in  the  wavelength  region  of 
interest  (1.2- 1.6  pm),  due  to  the  incorporation  of  the  diffusing  ions, 
thereby  causing  a resonant  behavior  of  the  refractive  index. 

The  directional  coupler  consists  of  two  identical  channel 
waveguides  with  separation  S,  as  shown  in  Fig.  6-1.  Therefore,  the 
index  profile  of  this  structure  is  a summation  of  index  profiles  of  the 
individual  straight  channels  given  by 


forx>0  (6.23) 


(6.24) 
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Such  a superposition  of  individual  waveguide  index  profiles  has  been 
verified  in  Ti  indiffused  LiNbC>3  directional  coupler  [Noda81]  and 
validated  the  use  of  the  above  index  profile  in  the  modeling  of  ion- 
exchanged  directional  couplers  presented  in  this  chapter. 

6.3.2.  Numerical  Method 

The  effective  index  method  (EIM)  along  with  the  1-D  matrix 
method  are  used  to  solve  the  mode  indices  in  the  modeling  of 
directional  couplers.  The  effective  index  method  and  matrix  method 
were  described  in  chapter  3 for  graded-index  channel  waveguides. 
The  effective  index  method  is  used  to  convert  a 2-D  index  profile  into 
its  1-D  equivalent  index  profile,  while  the  matrix  method  is  used  to 
solve  mode  indices  of  the  equivalent  profile.  In  the  case  of  directional 
couplers,  the  same  approach  can  be  employed.  We  first  calculate  the 
mode  index  of  the  individual  channel  waveguide  and  next  evaluate  the 
normal  mode  indices  of  the  directional  coupler.  As  addressed  earlier, 
the  mode  index  of  the  fundamental  normal  mode  of  the  coupler  is 
slightly  larger  than  the  mode  index  of  the  individual  waveguides  and 
that  of  the  first  order  normal  mode  is  slightly  smaller.  The  information 
obtained  from  the  evaluation  of  the  mode  index  of  the  channel  is  useful 
to  estimate  the  mode  indices  of  the  directional  coupler  since  the 
initial  guess  is  reasonably  close.  Therefore,  considerable  CPU  time  can 
be  saved  in  the  modeling. 
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6.3.3.  Directional  Coupler 

The  coupling  length  lc  of  the  directional  coupler  equals  7i/((3+-p') 
or  0.5X/(N+-N  ),  where  N+  and  N are  the  normal  mode  indices  of  the 
fundamental  and  first  higher  modes  of  the  directional  coupler, 
respectively.  The  coupling  length  is  evaluated  by  the  normal-mode 
approach  [lc=rc/(P+-P  )]  instead  of  the  coupled-mode  approach 
[1c=k/(2k)]  with  k evaluated  by  Eq.  (6.3)  since  the  latter  involves  an 
overlap  integral  of  the  individual  mode  fields  which  requires  more 
CPU  time. 

The  normalized  power  outputs  Pi  and  P2  defined  in  Eq.  (6.7), 
assuming  power  is  launched  into  channel  1 at  z = 0,  can  be  rewritten 
in  terms  of  the  coupling  length  as 


respectively,  where  z = L is  the  interaction  length.  Note  that  Pi  and  P2 
as  well  as  the  coupling  length  are  functions  of  wavelength  since  the 
coupling  is  wavelength-dependent.  If  the  length  L is  not  exactly  equal 
to  (2m- 1)  lc,  the  power  is  not  quite  zero  at  port  1.  This  cross  power 
ratio  represents  the  nonzero  output  from  port  1,  normalized  to  P2. 

From  the  above  equations,  the  cross-power  ratio  in  dB  can  be  shown  to 
be 


where  AL  - L-(2m-l)  lc,  is  the  small  deviation  from  the  exact  length 
required  to  accomplish  complete  transfer  of  power.  For  example,  for  a 


PlM  = cos2(-fl  L ) 

21cft) 


(6.25) 


and 


P2ft)  =sin2(^i_) 

21c(?tr  • 


(6.26) 


189 


coupling  length  of  5.4  mm,  to  achieve  a cross  power  ratio  of  -30  dB 
and  -40  dB,  the  fabrication  error  in  the  device  length  must  be  within 
±109  pm  and  ±34  pm,  respectively.  Similarly,  for  L close  to  an  even 
number  of  coupling  length,  the  cross  power  ration  at  port  2 is 


10  log  £2  = 10  log 


?2 

Pi 


= 10  log 


flic  ALl 

V 21c  I 


(6.27) 


where  AL  = L-2m  lc.  Note  that  in  Eqs.  (6.26)  and  (6.27),  although  AL  is 
the  same,  the  associated  L’s  are  different. 

The  spectral  bandwidth  at  the  operating  wavelength  lQ,  derived 
from  Eq.  (6.25)  by  the  perturbation  theory,  for  a given  cross  power 
ratio  (caused  by  the  deviation  in  wavelength)  at  port  i,  is 


2AX  = 


(dkji 


m=l,2,...  (6.28) 


For  example,  in  the  specific  case  considered  in  the  following  section, 

with  A,0  = 1.3  pm,  L equals  to  5.4  mm  (one  coupling  length),  and 

assuming  a reasonable  value  for  the  derivative,  — = -0.022  mm/nm  (as 

dA, 

seen  from  Fig.  6-10  described  later),  the  spectral  bandwidth  for  a 
specified  cross  power  ratio  of  -20  dB  and  -30  dB  into  port  1,  must  be 
within  31  nm  and  9.9  nm,  respectively.  A larger  bandwidth  can  be 
obtained  by  designing  the  device  with  minimum  m as  well  as  longer  lc 
and  smaller  . However,  for  a fixed  m,  the  bandwidth  at  each 

channel  is  almost  independent  of  the  two  center  wavelengths  of  the 
demultiplexer  and  increases  with  increasing  separation  of  the  two 
wavelengths.  For  asymmetric  directional  couplers  used  as  wavelength 
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demultiplexers,  the  bandwidth  required  to  achieve  a given 
transmission  level  (-20  or  -30  dB)  is  much  smaller  [Marc87]. 

6.3.4.  Design  of  Dual-Wavelength  Demultiplexers 

As  discussed  earlier,  the  coupling  length  lc  is  wavelength- 
dependent.  At  longer  wavelengths  lc  decreases  due  to  stronger 
coupling  between  the  modal  fields.  For  a given  interaction  length  L, 
the  wavelength-dependent  behavior  of  lc  results  in  different  amounts 
of  power  being  transferred  at  different  wavelengths  to  the  output 
ports.  As  shown  in  Fig.  6-2,  for  dual  wavelength  demultiplexing,  we 
require  complete  power  transfer  to  port  2 at  wavelength  and  to 
port  1 at  wavelength  In  general,  the  number  of  coupling  lengths  in 
the  region  of  interaction  must  be  an  odd  number  for  A,i  and  an  even 
number  for  \2.  In  other  words,  it  is  required  that  (2mi-l)  lcfti)  = 2m2 
Ic(^2).  where  mj  and  m2  are  positive  integers. 

In  designing  the  demultiplexer,  we  define  a parameter  R as  the 
ratio  of  the  normal  mode  AN’s  at  the  two  operating  wavelengths.  From 
the  relation  in  the  last  section,  the  ratio,  for  splitting  1.3  and  1.55  pm, 
is 


p = AN(1.55  pm)  1.55  1J  1.3  pm) 

AN(1.3  pm)  " 1.3  lc(l. 55  pm)  • (6  29) 

Notice  that  lc(1.3gm)  is  always  greater  than  lc(1.55pm).  Hence,  R is 
larger  than  1.19.  Many  values  of  R can  meet  the  requirement  of  a 
1.3/1.55  pm  wavelength  demultiplexer  as  shown  in  Table  2,  where 
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Fig.  6-2.  The  operating  principle  of  DEMUX,  (a)  Wave  input  from  WG1  is 
decomposed  into  two  normal  modes,  (b)  The  energy  transfers  to 

WG2  for  one  coupling  length  at  (c)  The  energy  transfers  back 
to  WG 1 for  two  coupling  lengths  at 
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Table  2 

Ratio  of  coupling  length  vs.  R 


lc(  1 .3pm)  :lc(  1 . 55pm) 

R 

2:1* 

2.385 

3:2 

1.788 

4:3* 

1.590 

4:1* 

4.769 

5:4 

1.490 

5:2 

2.981 

* 1.3pm  wavelength  output  at  port  2, 
otherwise  at  port  1 
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only  six  of  the  many  possible  pairs  are  listed.  To  obtain  complete 
transfer  of  power  in  the  case  of  weak  coupling,  usually  lc  (and,  hence, 

L)  is  rather  long;  this  results  in  higher  propagation  loss  and  is, 
therefore,  not  practical.  The  minimum  length  of  the  device  operating 
at  1.3/1.55  pm  is  achieved  when  the  interaction  length  L equals  one 
coupling  length  at  1.3  pm  and  two  coupling  lengths  at  1.55  pm  so  that 
the  ratio  R equals  2.385. 

If  R deviates  from  the  optimized  value  in  the  fabrication,  the 
performance  of  the  wavelength  demultiplexer  will  be  degraded  and, 
thus,  the  output  port  1 or  port  2 may  have  energy  at  both  wavelengths. 
The  crosstalk  ratios  for  port  1 are 


_ Pi(1.3pm) 
Pi(1.55  pm) 

(6.30) 

and  for  port  2 

_ P2(1.55  pm) 
P2(1.3  nm)  • 

(6.31) 

where  Pi  and  P2  are  expressed  in  Eq.  (6.25).  Predicting  by  the 
coupled-mode  theory,  both  xi  and  X2  are  zero  for  perfect 
demultiplexing  at  1.3  and  1.55  pm.  However,  fabrication  error  is 
inevitable,  causing  the  ratio  R to  depart  from  the  optimized  value. 
Three  cases  are  considered  for  the  estimation  of  the  crosstalk  ratio  x 
in  terms  of  AR  and  AL,  where  it  is  assumed  that  R = R0+AR  with  AR 
being  small  compared  to  the  desired  Rq(=2.385)  and  AL  = L - 
lCo(l-3pm)  or  L-  2 lco(  1 .55pm).  The  subscript  ‘o’  represents  the 
reference  values  for  the  demultiplexer  in  question,  with  no  deviation. 
The  same  notation  is  used  in  the  following  discussion  of  a 
demultiplexer  operating  at  the  wavelength  1.3/1.55  pm.  We  have 
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omitted  the  units  of  the  wavelength,  viz.,  pm,  for  the  sake  of 
convenience. 

Case  1:  L = 2 lc(1.55)  = 2 lp0(1.55)  and  lc(  1 .3)  = 1C0(1.3)+A1C,  where 
Alc  is  rather  small  compared  to  lco(1.3).  In  this  case,  X2  = 0 since 
L equals  exactly  twice  the  coupling  length  at  1.55  pm.  However, 
Xl  in  Eq.  (6.30)  is  not  zero  and  is  approximately  equal  to  Pi (1.3). 
From  Eq.  (6.25),  we  find 


Xi  = cos' 


K 


2 + 


AL 


\ led- 55)/ 


(6.32) 


With  some  mathematical  manipulation  from  Eq.  (6.29),  the 
deviation  AR  can  be  expressed  as 


AR  = Ro 


AIq 

lco(1.3) 


= Ro 


Alo 

2 lpod.55)  • 


(6.33) 


From  the  above  two  equations  eliminating  Alc,  the  crosstalk  ratio 
Xi  is  obtained,  for  small  deviation  AR,  in  terms  of  the  ratio  of  ^ 


Xi 


7T2/ar|2 

4(Rj  • 


(6.34) 


According  to  the  above  equation,  the  change  in  R must  be  kept 
within  ±2%  in  order  to  achieve  a crosstalk  ratio  of  -30  dB  and 
±0.64%  for  -40  dB. 

Case  2:  L = lc(1.3)  =lco(1.3)  and  lc(  1 .55)  = 1C0(1.55)+A1C,  where  Alc 
is  small  compared  to  lco(1.55).  xi  is,  now,  zero  for  L = lc(  1.3). 
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Following  a similar  derivation  procedure  as  used  in  the  previous 
case,  we  obtain  an  expression  for  X2- 


X2  = n 


2(ar 

IrJ 


2 


(6.35) 


The  crosstalk  is  worse  by  a factor  of  four  since  the  coupling 
length  for  1.3  pm  is  twice  that  for  1.55  pm  for  the  same 
interaction  length  thus  imposing  a stricter  requirement  on  R.  In 
this  case,  to  achieve  a crosstalk  ratio  less  than  -30  dB  and  -40 
dB,  the  error  in  R must  be  controlled  within  ±1%  and  ±0.32%, 
respectively. 

Case  3:  R=Rq  and  L = 1C0(1.3)+AL  = 2 1C0(1.55)+AL,  where  AL  is  a 
small  deviation  in  the  interaction  length.  With  similar  derivation 
procedure,  the  crosstalks  at  port  1 and  port  2 are 


7T2/alI2 
4 \L0 ) 

(6.36) 

7X;2  (AL 

V Lo 

(6.37) 

respectively,  where  L0  equals  lCo(1.3)  = 21co(1.55).  Note  that  the 
crosstalk  in  this  case  is  different  from  the  cross  power  ratio 
defined  in  last  subsection;  the  crosstalk  is  the  power  ratio  at  the 
same  output  port  between  two  different  wavelengths  while  the 
cross  power  ratio  is  the  power  ratio  at  the  same  wavelength 
between  two  ports.  Comparison  of  Eqs.  (6.26)  (lc=L0  at  1.3  pm) 
and  (6.27)  (21C=L0  at  1.55  pm)  with  Eqs.  (6.36)  and  (6.37)  shows 
that  cross  power  ratio  is  essentially  the  same  as  the  crosstalk  as 
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long  as  the  normalized  power  transferred  at  a given  wavelength 

approaches  unity. 

6.3.5.  Numerical  Results 

The  fundamental  mode  index  of  the  channels  with  mask  width 
Wm  = 3.5  pm,  calculated  by  the  effective  index  method  with  two  fitting 
parameters  Anmax  and  W,  is  shown  in  Fig.  6-3.  For  this  case,  Anmax  = 
0.0116  and  W = 4.9  pm  are  values  that  best  Fit  the  experimental  data  of 
N as  well  as  the  measured  coupling  length,  which  will  be  presented  in 
section  6.5.  The  two  open  squares  denote  the  measured  mode  indices 
of  the  channel  waveguide  by  the  prism  coupler  technique  using  the 
laser  diode  sources  available  in  our  laboratory.  The  dashed  line 
represents  the  calculated  fundamental  mode  index  as  a function  of 
wavelength.  The  solid  line  represents  the  commercial  BK7  substrate 
index.  The  simulation  data  agree  well  with  the  experimental  results 
for  the  above  mask  width.  The  figure  also  shows  that  the  mode  indices 
N are  much  larger  than  the  substrate  index  and  hence  the 

fundamental  mode  is  well-confined  in  the  wavelength  range  1.3- 1.6 
pm. 

For  different  mask  opening,  W has  different  values  and  increases 
with  increasing  mask  opening  while,  as  expected,  Anmax  remained 
essentially  unchanged.  The  single-mode  wavelength  regime,  for  Wm  = 
3.5  pm,  is  from  1200  nm  to  at  least  1600  nm  since  the  fundamental 
mode  cutoff  wavelength  is  larger  than  our  measurement  limit  (1.6  pm). 
The  horizontal  width  (2W  = 10  pm)  of  the  waveguide  is  approximately 
three  times  larger  than  the  mask  opening  Wm,  indicating  the  validity 
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Fig.  6-3.  The  refractive  index  as  a function  of  wavelength.  The  solid  line  represents  BK7  substrate 
index.  The  dashed  line  represents  the  calculated  fundamental  mode  index  of  channel 
waveguide  with  mask  opening  of  3.5  pm.  The  open  squares  are  the  measured  mode 
indices  of  the  channel  using  the  prism  coupler  technique.  The  waveguide  were  produced 
by  diffusing  for  12  hours  at  350°C. 
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of  the  assumption  of  simicircular  gaussian  index  profile.  Both  Anmax 
and  W (for  a given  Wm)  obtained  from  the  simulation  of  the  channel 
waveguide  are  used  in  the  modeling  of  the  directional  coupler. 

As  described  earlier,  the  normal  mode  indices  of  the  directional 
coupler  are  evaluated  by  the  effective  index  method.  It  is  well  known 
that  the  method  is  not  very  accurate  in  estimating  the  mode  indices, 
especially  in  the  region  close  to  cutoff.  However,  the  coupling  length, 
determined  by  the  normal  mode  approach,  is  found  to  be  sufficiently 
accurate  since  it  is  evaluated  by  the  difference  the  two  normal  mode 
indices.  Hence,  R can  be  evaluated  from  Eq.  (6.29)  once  the  coupling 
lengths  at  two  operating  wavelengths  are  known.  Since  the  ratio  R is 
critical  in  determining  the  performance  of  the  demultiplexer,  to 
arrive  at  the  correct  value  of  R,  several  device  parameters  such  as  the 
interaction  length  L,  the  separation  S,  the  maximum  index  change 
Anmax  and  the  waveguide  size  can  be  adjusted  at  will  so  that  the 
crosstalk  may  be  minimized. 

Figure  6-4  shows  the  ratio  R as  a function  of  the  half-width  (W) 
of  waveguide  for  different  values  of  S in  the  range  11.65  to  12.2  pm 
while  Anmax  is  kept  constant  at  0.0116.  The  curves  shift  downward  as 
the  separation  S decreases.  The  horizontal  dashed  line  marked  as 
1.3/1.55  denotes  the  value  of  R=2.385.  The  intersection  points 
between  the  horizontal  dashed  line  and  the  curves  represent  the  exact 
operating  points  of  the  demultiplexer  at  1.3/1.55  pm.  The  bottom 
curve  with  S = 11.65  pm  has  no  interaction  point,  indicating  that  the 
two-to-one  ratio  of  the  coupling  lengths  at  1.3/1.55  pm  is  not 
attainable.  Therefore,  in  this  case,  we  need  to  consider  cases  with  R 
values  less  than  2.385  (Table  2).  Obviously,  as  the  interaction  length  of 
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Fig.  6-4.  Calculated  R as  a function  of  the  waveguide  half- width  W for  An ,^=0.01 16.  The  dotted  line 
represents  the  ratio  R equal  to  2.385.  The  behavior  of  R is  illustrated  for  varying  values  of  the 
separation  S = 12.2  pm  (long  dashed),  12.0  pm  (solid),  1 1.8  pm  (short  dashed),  and  1 1.65 
pm  (dotted-dashed). 
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the  directional  coupler  becomes  longer,  the  associated  insertion  loss 
increases  while  the  spectral  bandwidth  decreases.  For  the  shortest 
device,  S s 11.65  pm  appears  to  be  the  lower  limit  in  the  case  of 
Anmax  = 0.0116.  As  seen  from  the  figure,  for  a given  S,  there  are  two 
different  intersection  points  and,  therefore,  the  desired  value  of  R is 
obtained  for  two  different  waveguide  sizes.  Either  intersection  point 
can  be  chosen  as  long  as  the  coupling  remains  weak  and  the  individual 
channel  is  single-moded  at  both  wavelengths.  For  a given  Anmax,  at 
small  values  of  W,  the  mode  approaches  cutoff  and  the  coupling 
becomes  stronger.  With  larger  values  of  W,  the  waveguide  may  become 
multimoded.  Increasing  S shifts  the  curve  upward  and  the  separation 
between  the  two  intersection  points  increases.  As  S increases,  the 
intersection  point  on  the  right  hand  side  moves  toward  the  larger 
width  of  the  waveguide,  where  the  channel  may  become  multimoded 
at  shorter  wavelength.  On  the  other  hand,  the  other  intersection  point 
shifts  at  a slower  rate  to  the  left  and  approaches  the  fundamental 
mode  cutoff  at  longer  wavelength,  where  the  coupling  becomes  quite 
strong.  The  simulation  shows  that  the  channel  becomes  multimoded 
for  the  waveguide  half-width  greater  than  5.5  pm  at  1.3  pm  wavelength 
with  Anmax  = 0.0116.  In  the  case  of  S greater  than  12.2  pm,  with  R = 
2.385,  the  waveguide  becomes  either  multimoded  for  large  width 
corresponding  to  the  intersection  point  at  right  or  the  coupling 
becomes  stronger  for  small  width  on  the  other  side.  Hence,  the  upper 
limit  of  S is  12.2  pm.  Thus,  S must  be  within  a restricted  range  of 
11.65  pm  and  12.2  pm  for  Anmax=0.01 16. 

The  magnitude  of  the  slope  in  the  curve  represents  the 
sensitivity  of  the  ratio  R to  the  changes  in  the  waveguide  size.  As  the 
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magnitude  increases,  the  sensitivity  increases,  implying  that  the 
fabrication  tolerance  is  more  restrictive.  For  S = 12  pm  and  a 0.05  pm 
change  in  half-width  W,  the  relative  change  in  R,  i.e.,  AR/Rq,  is  1.1% 

and  0.97%  for  the  intersection  points  on  the  left  and  the  right  hand 
side,  respectively,  corresponding  to  the  crosstalk  ratio  %2  of  -29  dB 

and  -30  dB.  However,  the  fabrication  tolerances  are  not  so  severe,  if 
the  intersection  point  is  exactly  located  at  the  zero-slope  point  with  S 
slightly  greater  than  11.65  pm  and  W equal  to  4.25  pm.  In  the  case  of 
weak  coupling,  note  that  the  zero-slope  point  occurs  almost  at  the 
same  width  for  all  values  of  S within  the  range  indicated,  due  to  the 
lack  of  dependence  of  the  characteristics  of  the  channel  waveguide  on 
the  separation  S.  Also  as  seen  from  Fig.  6-4,  the  magnitude  of  the 
slope  at  the  intersection  point  is  larger  on  the  left  hand  side  than  on 
the  right  hand  side,  indicating  that  the  intersection  point  with  large 
waveguide  width  is  preferable  because  of  the  larger  fabrication 
tolerance. 

Although  for  S = 11.65  pm,  no  intersection  with  R = 2.385  line 
occurs,  the  maximum  index  change  Anmax  can  be  increased  so  that 
the  curve  intersects  the  horizontal  dotted  line,  as  shown  in  Fig.  6-5. 
Again,  R = 2.385  is  satisfied  for  two  waveguide  sizes  with  the  curve 
shifting  upward  with  increasing  Anmax.  For  smaller  separation  S, 
Anmax  needs  to  be  larger  and  since  the  intersection  point  on  the  right 
hand  side  may  cause  the  guide  to  become  multimoded,  the  left 

intersection  point  with  smaller  waveguide  size  should  be  chosen  for 
high  Anmax. 

For  a given  set  of  fabrication  conditions,  and  under  weak 
coupling  conditions,  the  behavior  of  the  individual  channel  waveguides 
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Fig.  6-5.  Calculated  R as  a function  of  the  waveguide  half-width  W for  S=1 1.65  pm  for  various  values 
of  the  maximum  index  change  at  surface  Anniax  = 0.0122  (dotted-dashed),  0.0120  (short 
dashed),  0.01 18  (long  dashed),  and  0.01 16  (solid).  The  horizontal  dotted  line  represents  the 
ratio  R equal  to  2.385. 
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remains  almost  unaffected  with  variation  in  S.  The  value  of  S,  however, 
is  subject  to  the  restrictions  discussed  in  the  previous  paragraphs. 
Consequently,  the  wavelength  range  over  which  the  channel  guide 
remains  single-moded  is  the  same  as  the  range  in  which  two  normal 
modes  exist  in  the  directional  coupler.  Moreover,  the  ratio  R is  almost 
a linear  function  of  S with  its  slope  approximately  equal  to  0.35  pm'1, 
as  shown  in  Fig.  6-6  for  W = 4.9  pm.  This  implies  that  R is  predictable 
if  we  first  make  some  trial  directional  couplers  with  different  gaps  and 
measure  the  coupling  lengths  for  at  least  two  wavelengths.  The  nearly 
linear  relationship  between  R and  S is  extremely  useful  in  designing 
dual  wavelength  demultiplexers. 

Figure  6-7  illustrates  the  coupling  length  versus  wavelength  for 
the  separation  S = 11.8  pm  (long  dashed  line),  12.0  pm  (solid  line), 
and  12.2  pm  (short  dashed  line).  The  coupling  becomes  weaker  and 
the  coupling  length  becomes  longer  as  S increases  or  the  wavelength 
decreases.  The  required  interaction  length  of  the  demultiplexer  can 
be  determined  once  the  two  coupling  lengths  have  been  calculated. 
The  numerical  simulation  provides  useful  guidelines  not  only  in 
designing  the  wavelength  demultiplexer  but  also  in  yielding  the 
required  fabrication  conditions  so  that  the  device  functions  efficiently 
at  the  two  wavelengths,  while  meeting  the  specifications. 

6.4.  Fabrication 

The  device  was  fabricated  using  a procedure  similar  to  that 
presented  in  chapter  4.  The  difference  is  that  an  extra  step  is  needed 
in  the  photolithography  if  only  the  mask  with  well-isolated  channel 
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Fig.  6-6.  Calculated  R versus  the  separation  S for  W = 4.9  pm  with  the  maximum  index  change  at 

surface  as  parameter  - Anniax  = 0.01 14  (long  dashed).  0.01 16  (solid),  and  0.01 18  (short 
dashed).  The  dotted  line  represents  the  ratio  R equal  to  2.385. 
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Fig.  6-7.  Coupling  length  lc  (mm)  versus  wavelength  X (nm).  For  W = 4.9  pm  and  An,^  = 0.01 16, 

the  dashed,  solid,  and  dotted  curves  correspond  to  S = 1 1.8,  12.0,  and  12.2  pm, 
respectively. 
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waveguide  patterns  is  available.  The  photoresist  film  on  the  glass 
substrate  is  prepared  as  described  previously.  After  the  exposure  to 
the  UV  light,  the  sample  is  slightly  separated  from  the  mask  and 
shifted  a distance  S (=12  pm)  horizontally  in  the  direction  normal  to 
the  channels.  Next,  the  sample  is  contacted  to  the  mask  again  and 
exposed  to  the  UV  light.  In  the  second  exposure,  part  of  the  sample  is 
blocked  so  that  the  channel  patterns  created  by  the  second  exposure 
are  shorter  than  those  generated  by  the  first  exposure.  Consequently, 
device  patterns  as  shown  in  Fig.  6-2  can  be  fabricated. 

The  salt  bath  used  is  the  same  as  described  in  chapter  4.  The 
diffusion  temperature  and  time  are  350°C  and  12  hours,  respectively. 
The  resulting  channel  waveguides  support  fairly  well-guided,  single 
modes  at  both  wavelengths,  viz.,  1.3  and  1.55  pm  for  WM  varying  from 
2.5  to  4.0  pm.  For  mask  openings  wider  than  4.0  pm,  a shorter 
diffusion  time  is  required  to  fabricate  single-mode  channel 
waveguides.  As  simulated  in  the  last  chapter,  the  fabrication  conditions 
are  critical.  As  a result,  single-mode  operation  may  not  always  be 
possible  at  both  operating  wavelengths.  Moreover,  if  the  diffusion  time 
is  too  short,  the  device  operates  near  cutoff  at  longer  wavelengths  and 
may  suffer  higher  insertion  loss. 

Since  the  device  performs  very  differently  depending  on  the 
separation,  S must  be  controlled  precisely  — typically,  to  an  accuracy 
better  than  one  tenth  of  micrometer.  As  shown  in  Fig.  6-6,  for  a 
variation  of  0.1  pm  in  S,  AR/R0  is  approximately  1.5%,  corresponding 
to  a X2  of  -27  dB.  In  the  weak  coupling  regime,  the  difference  in  the 
propagation  constants  between  the  two  individual  channels  of  the 
directional  coupler  must  be  as  close  to  zero  as  possible  in  order  to 
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achieve  maximum  power  transfer.  Any  imperfections  in  the  fabrication 
of  the  channels  will  certainly  degrade  the  performance  of  the  devices. 

6.5.  Characterization 

In  almost  all  the  applications  of  the  directional  couplers,  the 
coupling  length  determines  the  performance  of  these  devices. 
Therefore,  the  characterization  of  the  coupling  length  is  of  paramount 
importance.  In  this  section  a novel  technique  [Chen90c]  is  presented 
to  determine  the  coupling  lengths  of  directional  couplers  as  a function 
of  wavelength.  The  coupling  lengths  are  evaluated  from  the  measured 
normalized  power  outputs  based  on  coupled-mode  theory  by  assuming 
that  the  symmetric  couplers  consists  of  two  weakly  coupled 
waveguides.  The  experimental  setup  to  measure  the  outputs  and  the 
procedure  to  estimate  the  coupling  length  are  presented  in  the 
following  subsections. 

6.5.1.  Measurement  of  Spectral  Response 

As  discussed  earlier,  the  outputs  Pi  and  P2  are  wavelength- 
dependent.  They  can  be  measured  by  the  experimental  setup  shown  in 
Fig.  6-8.  A high-power  tungsten  lamp  is  used  in  the  setup  as  a source 
for  the  monochromator.  The  light  from  the  lamp  is  focused  at  the 
entrance  of  the  monochromator  with  a slit  of  ~1  mm  opening.  The 
positions  of  lenses  1 and  2 are  adjusted  until  the  output  from  the 
monochromator  is  maximum  at  the  operating  wavelength.  Hence,  a 
temporary  detector  is  needed  at  output  of  the  monochromator  in 
building  the  setup.  The  output  slit  of  the  monochromator  is  set  to  be 
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Fig.  6-8.  The  experimental  setup  used  to  measure  the  spectral  response  of  the  directional 
couplers.  The  white  light  source  is  used  for  alignment  and  is  removed  during  the 
measurement  of  the  spectral  response  of  the  devices. 
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about  100  [im  determining  the  linewidth  of  the  monochromatic  light. 
Lens  3 is  used  to  collimate  the  output  light  from  the  monochromator, 
and  the  10X  objective  is  used  to  collect  the  light  and  launch  it  into  a 
multimode  fiber.  Since  the  waveguide  is  a single-mode  guide,  only  the 
fundamental  mode  can  be  excited.  The  light  which  is  not  coupled  into 
the  fundamental  mode  radiates  away  and  does  not  affect  the 
propagation  of  the  wave  in  the  coupler.  A fiber  light  source  and  the 
beam  spitter  between  the  lens  3 and  the  objective  are  used  solely  for 
the  purpose  of  aligning  the  directional  coupler  and  are  removed  once 
it  is  aligned.  The  light  from  the  monochromator  or  the  fiber  light 
source  are  butt-coupled  to  the  device  through  a multimode  fiber.  The 
output  from  the  coupler  is  magnified  and  focused  by  a 40X  objective 
and  imaged  onto  a Ge  detector  by  lens  4.  A linear  polarizer  and  a 
chopper  are  inserted  between  the  objective  and  an  aperture  so  that  a 
polarized,  chopped  signal  can  be  obtained.  The  signal  is  amplified  by  a 
preamplifier  and  a lock-in  amplifier  and,  then,  fed  into  a computer. 

In  the  alignment,  the  use  of  a white  light  source  instead  of  a 
visible  laser  is  required  since  at  the  visible  laser  wavelength  very  little 
power  transfer  occurs  and  only  the  straight  through  output  is  seen.  On 
the  other  hand,  the  white  light  passing  through  the  directional 
coupler  with  two  channels  generates  an  image  with  two  spots.  The 
near-field  output  is  focused  by  a 40X  objective  at  the  aperture  which  is 
used  to  choose  one  of  the  output  spots.  It  is  easier  to  isolate  one  spot 
from  the  other  by  the  aperture  only  if  they  are  far  away  from  each 
other,  i.e.,  weakly  coupled.  In  the  case  of  a strongly  coupled  waveguide 
system,  a bend  structure  is  usually  needed  to  separate  the  outputs  for 
accurate  measurement.  The  measurement  is  performed  twice  for  one 
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coupler,  i.e„  one  for  each  channel,  by  varying  the  wavelength.  The 
normalized  power  outputs  is  obtained  by  dividing  the  individual 
channel  output  by  the  total  power  output  from  both  channels. 

Since  the  monochromator  output  is  weak,  a chopper  and  a lock- 
in  amplifier  are  necessary  to  measure  the  signal.  In  the  measurement, 
the  time  constant  of  the  lock-in  amplifier  is  set  to  one  second  which 
is  considered  much  longer  than  the  noise  period.  Since  the  resolution 
of  the  monochromator  is  about  5 nm  in  IR,  the  wavelength  scanning  of 
the  monochromator  should  not  exceed  5 nm/sec  otherwise  the  signal 
may  be  smoothed  out  and  an  accurate  measurement  is  not  achievable. 
We  arbitrarily  choose  the  scanning  speed  of  100  nm/min.  In  this  case 
it  takes  4 minutes  to  scan  from  1200  nm  to  1600  nm. 

Figure  6-9(a)  shows  a typical  normalized  outputs  (P2)  versus 
wavelength  for  the  TE  mode.  The  solid  and  the  dashed  lines  represent 
the  interaction  lengths  Lj  = 16.2  mm  and  L2=13.1  mm,  respectively. 
The  notation  ‘p’  will  be  discussed  in  the  next  subsection.  The  figure 
exhibits  sinusoid -like  curves  indicating  that  complete  power  transfer 
can  be  achieved  at  certain  wavelengths  depending  on  the  interaction 
length.  The  evaluation  of  the  coupling  length  from  the  measured  P2  is 
presented  in  following  section. 

6.5.2.  Determination  of  the  Coupling  Length 

The  coupling  length  lc  can  be  estimated  by  measuring  the 
normalized  power  output  P2  as  a function  of  wavelength  at  port  2.  This 
is  easily  seen  by  rewriting  Eq.  (6.25)  as 
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Fig.  6-9.  Normalized  output  P2  and  coupling  length  vs.  wavelength,  (a) 
P2.  (b)  Coupling  length. 
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\C(X)  = kL/2 

m7c±sirr1VP2(^)  ’ 


(6.38) 


where  m is  an  integer  and  equals  0.  1,  2,...,  etc.  The  selection  of  the 
plus  or  minus  sign  and  m,  which  are  determined  by  the  phase 
difference  of  the  even  and  odd  modes  as  well  as  the  number  of  the 
cycles  of  power  transfer  between  the  guides,  are  discussed  later. 
Conventionally,  at  least  two  measurements  of  P2  for  two  close 
interaction  lengths  for  a given  A,  are  required  to  evaluate  lc.  As  the 
period  of  oscillation  increases  either  with  shorter  coupling  length  or 
with  increasing  asymmetry  as  the  case  may  be,  the  conventional 
approach  will  not  lead  to  meaningful  results.  On  the  other  hand,  Eq. 
(6.38)  and  the  spectral  response  measurement  for  different 
interaction  lengths  of  the  same  directional  coupler  may  give  accurate 
values  of  lc  as  a function  of  wavelength.  The  spectral  response 

technique  described  here  is  quite  useful  in  extracting  the  coupling 
length. 

Figure  6-9(a)  shows  the  normalized  power  output  P2  as  a 
function  of  the  wavelength.  P2  is  maximum  whenever  the  interaction 
length  L (=Li  or  L2)  equals  an  odd  number  of  coupling  lengths.  For  the 
purpose  of  discussion,  these  peaks  are  identified  as  2p+l,  2p+3,  etc. 
at  the  top  of  Fig.  6-9(a)  for  different  wavelengths,  where  p is  a non- 
negative  integer.  P2  is  zero  for  the  interaction  lengths  corresponding 
to  an  even  number  of  coupling  lengths  marked  as  2p,  2p+2,  etc.  at  the 
bottom  of  the  same  figure.  The  peaks  (maxima)  with  the  same  number 
of  coupling  lengths  shift  to  longer  wavelengths  for  devices  with 
shorter  interaction  lengths.  Furthermore,  the  periods,  i.e.,  the 
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wavelength  separation  between  two  adjacent  peaks  increase  as  the 
interaction  lengths  decrease,  which  implies  that  the  sensitivity  to 
wavelength  decreases  for  shorter  directional  couplers.  This  can  be 
easily  seen  in  the  case  of  the  wavelength  demultiplexer  with  the 
shortest  interaction  length  (L=5.4  mm)  operating  at  1.32  and  1.56  pm 
[Chen90b].  For  such  a short  device  incorporating  one  coupling  length 
at  1.32  pm  and  two  coupling  length  at  1.56  pm,  it  is  obvious  the  output 
should  be  somewhat  insensitive.  The  tolerance  in  terms  of  wavelength 
is  discussed  in  detail  in  section  6.4. 

For  the  same  interaction  length,  the  valley  (minimum)  in  Fig.  6- 
9(a)  adjacent  to  a given  peak  at  the  longer  wavelength  always 
corresponds  to  one  more  coupling  length  than  that  at  the  maximum 
since  the  coupling  length  is  shorter  at  longer  wavelength.  However, 
the  number  of  coupling  lengths  at  a given  maximum  or  minimum  can 
not  be  uniquely  determined  with  only  one  measurement  at  a given 
interaction  length;  at  least  two  measurement  with  different 
interaction  lengths  are  required.  We  will  illustrate  the  problem  for  the 
case  shown  in  Fig.  6-9(a).  Equation  (6.38)  has  been  used  to  calculate 
the  coupling  length  from  the  measured  P2  for  a given  interaction 
length,  say  L=Li.  Since  many  wavelengths  can  give  rise  to  the  same  P2, 
it  is  necessary  to  keep  track  of  m as  well  as  the  sign  in  the 
denominator  of  the  right-hand  side  of  Eq.  (6.38).  Furthermore,  we 
note  that  the  coupling  length  lc  must  be  a positive  quantity, 
monotonically  decreasing  with  increasing  wavelength.  Therefore,  lc  is 
largest  for  m=0  and  the  plus  sign  is  chosen.  In  this  case,  lc  decreases 
as  P2  increases  from  nearly  zero  at  lower  wavelength  to  one.  As  the 
wavelength  further  increases,  P2  will  decrease  from  the  peak  value 
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down  to  the  valley,  where  P2=0.  Since  we  require  that  lc  must  be 
continuous  and  decrease  monotonically,  m now  shall  be  set  to  one  and 
the  minus  sign  must  be  chosen  for  lc  in  the  interval  between  L and 
L/2.  1C=L  when  P2  equals  one  and  decreases  to  lc=L/2  when  P2 
becomes  zero.  As  indicated  in  Fig.  6-9(a),  the  minus  sign  is  always 
taken  for  P2  going  from  maximum  to  minimum  as  the  wavelength 
increases  and  the  plus  sign  is  taken  for  P2  going  from  the  valley  to  the 
next  peak  at  the  longer  wavelength,  m is  always  increased  by  one  unit 
when  P2  reaches  the  maximum  as  the  wavelength  increases. 
Consequently,  m equals  2p+l  in  the  region  between  the  peaks 
corresponding  to  lc(M=L/(2p+l)  and  lca)=L/(2p+3).  Thus,  the  value  of 
m remains  the  same  between  two  consecutive  peaks  and  increases  by 
unity  at  each  peak.  The  sign  in  Eq.  (6.38),  on  the  other  hand,  is 
negative  as  P2  decreases  and  is  positive  as  P2  increases.  In  this 
manner,  with  careful  bookkeeping,  lc  can  be  calculated  from  the 
measured  P2  by  using  Eq.  (6.38). 

Figure  6-9(b)  shows  the  computed  coupling  lengths  from  the 
measured  values  of  P2  as  a function  of  the  wavelength  assuming  various 
values  for  p,  viz.,  p=0,  1,  2,  and  3.  The  solid  lines  represent  the  case 
when  the  interaction  length  L=Li.  By  repeating  the  procedure,  lc 
versus  X is  evaluated  for  the  case  L=L2  and  is  represented  by  the 
dashed  lines  for  p=0,  1,  2,  and  3.  The  solid  curve  coincides  with  the 
dashed  curve  for  p=l  indicating  the  correct  value  of  p since  the 
coupling  length  must  be  independent  of  the  interaction  length  L.  The 
discontinuities  occurring  at  the  peaks  or  the  valleys  of  P2,  highlighted 

as  circles,  are  introduced  by  the  limited  signal-to-noise  ratio  available 
in  the  experiment  since  an  incoherent  light  source  was  used.  For 
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asymmetric  directional  couplers,  Eq.  (6.25)  is  not  applicable 
[Chen90c]  since  the  power  can  not  completely  transfer  to  waveguide 
2.  In  this  case,  the  normalized  powers  in  the  individual  waveguides 
can  be  derived  from  Eq.  (6.4)  as 

P i = = cos2(Kz)  + jp-  sin2(Kz) 

(6.39) 

Pa=^=l fsin2(KZ). 

where  K = "\j 82+k12k21  and  5 = (P2-P i)/2.  8 is  zero  for  the  symmetric 
case  and  the  above  equation  reduces  to  Eq.  (6.7).  When  the  coupling 
between  the  two  waveguides  becomes  stronger,  the  scalar  coupled - 
mode  theory  breaks  down  and  Eq.  (6.39)  is  not  applicable. 

6-6-  Experimental  Results  and  Comparison  with  Modeling 

Utilizing  the  technique  described  above,  we  measure  the 
normalized  power  outputs  Pi  and  P2,  which  are  used  to  evaluate  the 
coupling  length  lc  as  a function  of  wavelength.  Figures  6-10  (a)  and  (b) 
show  the  coupling  length  versus  wavelength  for  the  mask  opening  WM 
= 3.0  pm  and  3.5  pm,  respectively.  The  discrete  points  denote  the 
simulation  results  and  the  solid  lines  represent  the  coupling  lengths 
calculated  from  the  experimental  measurement.  As  expected,  the 
coupling  length  for  the  TE  mode  is  slightly  larger  than  that  for  the  TM 
mode  due  to  better  confinement  of  the  modal  field.  The  reason  being, 
although  the  waveguide  is  almost  homogeneous  and  isotropic,  the  TM 
mode,  experiences  a slightly  lower  propagation  constant,  has  a weaker 
confinement  and  hence,  a smaller  coupling  length.  Because  the 
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Fig.  6-10.  Estimated  and  measured  coupling  length  (mm)  for  TE  and  TM  modes  vs.  wavelength 
(nm).  The  simulation  results  are  represented  by  discrete  points,  where  (a)  W = 4.6  pm 
for  WM=  3.0  pm.  The  experimental  measurements  are  represented  by  the  solid  lines. 
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Fig.  6-10  - continued,  (b)  W - 5.0  qm  for  WM  = 3.5  qm. 
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channel  waveguide  becomes  multimoded  around  1.2  pm  wavelength, 
the  modeling  is  inapplicable  at  wavelengths  below  1.2  pm  since  the 
directional  coupler  now  has  more  than  two  normal  modes. 

The  devices  were  designed  to  operate  at  1.315/1.561  pm 
wavelength.  For  the  TE  mode,  the  simulation  shows  an  R equal  to 
2.352  for  the  demultiplexer  operating  at  these  two  wavelengths.  The 
experimental  data  show  that  R is  equal  to  2.378  for  the  same 
wavelengths,  in  the  case  of  Wm=3.5  pm.  The  coupling  lengths  for  the 
TE  mode  are  5.37  mm  and  2.68  mm  at  1.315  pm  and  1.561  pm 
wavelengths,  respectively.  However,  the  device  is  not  optimized  for 
TM  mode  and  the  ratio  R equals  2.338  for  the  two  wavelengths  in  our 
experiment.  In  the  simulation,  for  the  mask  openings  WM=  3.0  and  3.5 
pm,  the  waveguide  half-width  W are  4.58  and  4.97  pm,  respectively, 
and  the  maximum  index  change  Anmax  for  TM  mode  is  about  1-2X1  O'4 
lower  than  for  TE  mode.  This  is  believed  to  be  caused  by  a small 
birefringence  in  the  Ag+-Na+  exchanged  channel  waveguide. 

Figures  6-11  (a)  and  (b)  show  the  normalized  power  output  at 
port  2 as  a function  of  wavelength  with  L=  5.37  mm  for  WM=  3.0  pm 
and  3.5  pm,  respectively.  The  simulation  results  are  respectively 
represented  by  the  closed  and  open  circles  for  the  TE  and  TM  modes, 
and  the  experimental  data  are  denoted  by  the  solid  and  dashed  lines 
for  the  TE  and  TM  modes,  respectively.  From  the  experiment,  it  is 
seen  that  both  the  cross  power  ratio  £ at  1.561  pm  and  the  crosstalk 
Xi  at  port  1 are  -32  dB.  Both  the  cross  power  ratio  £ at  1.315  pm  and 
the  crosstalk  %2  at  port  2 are  -30  dB,  which  are  not  as  good  as  those  at 
port  1 because  the  interaction  length  L is  slightly  different  from  the 
coupling  length  lc(1.3  pm).  The  curve  for  the  TM  mode  shifts  about 
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Wavelength  (nm) 

Fig.  6-11.  The  normalized  power  output  P2  at  port  2 vs.  wavelength  (nm)  for  (a)  Wm=  3.0  pm. 

The  simulation  results  are  represented  by  closed  circles  (TE  mode)  and  open  circles 
(TM  mode)  and  the  experimental  measurements  by  the  solid  (TE  mode)  and  the 
dashed  (TM  mode)  lines.  The  interaction  length  is  5.37  mm. 
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Fig.  6-1 1 - continued,  (b)  WM=  3.5  (im. 
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16  nm  toward  shorter  wavelengths  compared  to  that  of  the  TE  mode 
since  the  TM  mode  exhibits  a shorter  coupling  length.  A small  hump 
at  1.39  pm  wavelength  is  caused  by  the  existence  of  a water  absorption 
peak  in  the  fiber  used  in  the  measurement  system.  Furthermore,  the 
behavior  of  the  coupling  length  in  Figs.  6- 10(a)  and  (b)  and  the  power 
output  in  Figs.  6- 11  (a)  and  (b)  as  functions  of  wavelength  are  similar 
for  both  cases,  viz.,  Wm=  3.0  pm  and  3.5  pm.  This  indicates  that  the 
coupling  between  the  guides  of  the  directional  coupler  as  well  as  the 
sizes  of  modal  fields  in  the  channel  waveguides  are  somewhat 
insensitive  to  the  mask  openings  in  this  range.  The  modeling  results 
are  in  excellent  agreement  with  the  experimental  results  for  both  the 
coupling  length  and  the  normalized  power  output,  as  illustrated  in 
Figs.  6-10  and  6-11.  The  devices  described  above  were  fabricated  and 
tested  to  verify  the  accuracy  of  the  model.  We  have  also  achieved  a 
demultiplexer  device,  where  the  shortest  intersection  length  (L=  5.37 
mm)  demultiplxer  was  optimized  to  operate  at  1.315/1.561  pm,  with 
insertion  loss  of  1.25  dB  for  a total  device  length  14.5  mm  and 
propagation  loss  of  0.15  dB/cm  for  the  mask  opening  of  3.5  pm  with  a 
cross  power  ratio  and  crosstalk  of  less  than  -40  dB.  Figure  6-12  shows 
the  near-field  intensity  profile  in  the  horizontal  direction  (y-axis)  at 
the  output  with  Wm  = 3.0  pm,  measured  by  the  near-field  setup  shown 
in  Fig.  4-9.  A 1.315  pm  wavelength  laser  diode  was  used  to  excite  the 
demultiplexer  at  the  input  end.  The  mask  opening  regions  are 
indicated  by  two  rectangulars  with  mark  1 as  waveguide  1 and  mark  2 
as  waveguide  2.  It  is  obviously  that  the  energy  is  confined  in  the 
waveguide  2.  The  cross  power  ratio  £ at  portl  was  measured  to  be  less 
than  -40  dB  at  this  wavelength. 


Cross  power  ratio  < -40  dB 
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ig.  6-12.  Output  scan  of  the  near  field  intensity  profile  across  the  output  ports 
1 and  2 in  the  plane  of  the  substrate  at  1.315  pm  wavelength. 
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To  check  the  validity  of  the  modeling,  devices  with  L equal  to 
three  times  of  lc  ( 1 . 3 1 5 gm)  and  six  times  of  lc(  1.561  gm)  were  also 
fabricated.  As  shown  in  Fig.  6-13,  again  the  simulation  agrees  very  well 
with  the  experimental  results  for  (a)  WM=  3.0  gm  and  (b)  WM=  3.5  gm. 
The  peaks  or  maxima  correspond  to  an  odd  number  of  coupling 
lengths  with  complete  power  transfer  to  port  2.  The  valleys  or 
minima,  on  the  other  hand,  correspond  to  an  even  number  of  coupling 
lengths  and  a complete  transfer  of  power  at  port  1.  As  indicated 
earlier,  the  fabrication  conditions  are  more  critical  for  these  longer 
devices  since  the  spectral  bandwidth  2AX  corresponding  to  -20  dB 
point  is  smaller  than  that  of  the  devices  shown  in  the  Figs.  6- 11  (a)  and 
(b).  The  sensitivity  to  the  polarization  also  increases,  which  is 
undesirable  in  coherent  optical  communication  applications.  In 
addition,  the  total  insertion  loss  increases  due  to  the  increase  in  the 
length  of  the  devices. 


6.7.  Summary 

In  this  chapter,  we  have  discussed  the  scalar  coupled-mode 
theory.  Although  the  application  of  the  theory  is  limited  to  the  case  of 
weak  coupling,  it  is  still  useful  in  predicting  the  power  transfer  in  the 
directional  couplers  since  the  devices  are  weakly  coupled.  The 
numerical  analysis  was  dealt  with  using  the  normal-mode  approach  so 

that  the  coupling  coefficient  can  be  estimated  from  the  normal  mode 
indices. 

A thorough  model  was  presented  for  the  1.32/1.56  dual 
wavelength  demultiplexer.  The  modeling  was  used  to  predict  the 
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coupling  lengths  of  the  devices  and  the  normalized  power  outputs. 
The  fabrication  parameters  were  optimized  by  the  simulation  so  that 
the  demultiplxers  were  successfully  fabricated.  The  crosstalk  and 
cross  power  ratio  were  defined  in  terms  of  the  fabrication  deviation 
AL,  AR  or  Alc.  The  crosstalk  and  cross  power  ratio  of  the 
demultiplexers  were  measured  to  be  less  than  -30  dB  at  the  operating 
wavelengths  of  1.315  and  1.561  pm.  The  experimental  setup  to 
measure  the  spectral  response  and  a novel  procedure  to  evaluate  the 
coupling  lengths  of  the  directional  couplers  were  also  addressed  in 
this  chapter. 


CHAPTER  7 

CONCLUSIONS  AND  FUTURE  WORK 
7.1.  Conclusions 

The  objective  of  this  project  was  to  perform  a general, 
systematic,  theoretical  and  experimental  study  of  the  ion-exchange 
process  in  glass.  This  research  work  includes  the  modeling 
characterization  of  1-D  and  2-D  diffusion  and  the  resulting  graded- 
index  waveguides  as  well  as  the  application  of  these  waveguides  to  two 
practical  devices:  tapered  waveguide  transitions  and  wavelength 
MUX/DEMUX  using  symmetrical  directional  couplers.  Excellent 
agreement  of  the  modeling  with  the  experimental  results  has  been 
obtained. 

The  following  is  a list  of  accomplishments  resulting  from  this 
research  work: 

1.  Fabrication  of  Ag+-Na+  exchanged  waveguides  with  mask 
material  consisting  of  an  AI2O3  thin  film  sandwiched  between 

the  top  aluminum  layer  and  the  substrate.  Single-mode,  low- 
loss  waveguide  devices  in  BK7  glass  with  propagation  loss  as 
low  as  0.15  dB/cm  were  obtained. 

2.  Numerical  solutions  of  single-step  and  two-step  diffusion  with 
and  without  an  externally  applied  field  by  the  Green’s  function 
method.  Conditions  to  achieve  symmetrical,  buried  waveguides 
were  obtained  in  the  simulation. 
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3.  Determination  of  the  silver  self-diffusion  coefficient  in  BK7 
glass  as  a function  of  temperature  by  the  inverse  WKB  method 
from  the  prism-coupler  measurement  and  comparison  to  the 
simulation  using  the  finite  difference  method. 

4.  Systematic  modeling  of  planar,  channel,  and  postbaking 
tapered  waveguides.  The  modeling  results  are  in  excellent 
agreement  with  experimental  measurements  of  the  index  and 
near -field  profiles. 

5.  Modeling  of  radiation  loss  of  tapered  waveguides  by  the  beam 
propagation  method  for  various  taper  functions.  Lossless  tapers 
are  achievable  for  taper  lengths  of  more  than  2 mm. 

6.  Thorough  simulation  of  symmetrical  directional  couplers  as 
dual  wavelength  MUX/DEMUX.  The  spectral  bandwidth, 
crosstalk,  and  cross  power  ratio  were  derived  in  terms  of  the 
fabrication  parameters. 

7.  Demonstration  of  a novel  technique  to  determine  the  coupling 
lengths  of  weakly-coupled  symmetrical  directional  couplers  as 
a function  of  wavelength  by  the  spectral  response. 

8.  Experimental  fabrication  of  1.32/1.56  pm  wavelength  MUX/ 
DEMUX  with  crosstalk  and  cross  power  ratio  less  than  -40  dB. 

The  results  obtained  in  this  work  demonstrate  that  ion- 
exchanged  waveguides  and  devices  can  be  fabricated  and  modeled  to 
determine  the  processing  parameters  a priori  for  specific  device 
performance.  Although  most  of  the  work  was  performed  using  the 
silver-sodium  cation  pair  and  BK7  glass,  the  theory  and  numerical 
method  presented  here  are  adequate  to  treat  other  cation  pairs  and 
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substrate  glasses.  In  the  following  sections,  we  outline  the  future 
prospects  of  the  ion-exchange  technique  and  point  out  some  of  the 
research  issues  which  remain  to  be  addressed  for  future  exploitation 
of  the  process  in  designing  novel  devices. 

The  scope  for  future  work  based  on  the  above  conclusions  is 
described  in  the  following.  The  research  to  be  pursued  in  the  future 
essentially  would  have  two  directions:  1)  advanced  study  of  the  ion- 
exchange  process,  2)  research  on  the  functional  devices  and 
integration  of  waveguide  devices.  These  two  topics  will  be  briefly 
discussed  in  the  following  sections.  The  goal  of  the  research  on  the 
ion  exchange  and  the  devices  is  to  apply  the  results  to  a practical 
optical  communication  system. 

7.2.  Advanced  Study  on  Ion  Exchange 

Since  the  ion  exchange  process  is  an  essential  part  of  modeling 
of  all  the  ion-exchanged  devices,  the  knowledge  gained  from  the 
advance  study  can  help  set  up  a more  versatile  modeling  which  is 
applicable  to  most  of  cases.  The  prospective  study  includes  the 
modeling  for  other  cation  pairs,  space-charge  effect,  local  field,  side 
diffusion,  electromigration,  and  thorough  analysis  of  2-D  buried 
waveguides  etc.  They  will  be  discussed  in  the  following  subsections. 

7.2.1.  Other  Cation  Pairs 

Ag+-Na+  exchange  and  the  devices  using  this  cation  pair  have 
been  concentrated  in  this  research  work.  Other  ion  pairs,  e.g.,  K+-Na+, 
Tl+-K+  etc.  for  different  glasses,  are  considered  to  be  very  useful. 
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Several  reports  have  been  published  dealing  with  the  case  of  K+-Na+ 
exchanged  waveguides  [see  reference  list  in  section  2.3].  It  is  well 
known  that  the  K+-Na+  exchange  creates  birefringence  of  the 
waveguides  due  to  stress-induced  index  change  [Yip85].  This  feature 
can  be  employed  to  fabricate  polarization-dependent  devices  such  as 
TE/TM  splitters.  Besides,  the  index  change  is  quite  small  and, 
therefore,  it  is  useful  for  fabricating  single-mode  devices.  As  a matter 
of  fact,  high-performance  waveguides  fabricated  by  K+-Na+  have  been 
realized  in  this  laboratory  [Mili89].  However,  a detailed  modeling  of 
the  2-D  ion  exchange  in  this  case  still  remain  to  be  performed. 

Low-loss  waveguides  fabricated  by  thallium  ion  exchange  have 
been  demonstrated  since  1972  [Izaw72].  Tl+  has  very  high 
polarizability  compared  to  Na+  or  K+  and,  therefore,  it  produces  quite 
high  index  change  (~0.1)  when  exchanged  with  Na+  or  K+  ions. 
Furthermore,  the  loss  of  the  thallium  exchanged  guides  is  very  small. 
However,  Tl+  is  highly  toxic  and  a good  ventilation  system  is  required 
during  the  fabrication.  Limited  results  have  been  reported  on  this 

cation  system  in  the  past  and  extensive  study  still  needs  to  be  carried 
out. 

The  self-diffusion  coefficients  for  different  exchanging  ions  are 
different  at  the  same  temperature.  Novel  devices,  fabricated  by  various 
ion  pairs  with  the  small-diffusion-coefficient  ions  exchanging  first  at 
high  temperature  followed  by  the  second  ion  exchange  at  lower 
temperature,  may  take  advantage  of  each  ion-pair  and  fulfill  the 
requirements  for  the  realization  of  new  devices.  In  this  case,  a 
systematic  study  and  modeling  are  necessary  for  the  multi-ion 
exchanged,  guided-wave  devices. 
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7.2.2.  Space  Charge.  Local  Field,  and  Side  Diffusion 

In  chapter  2,  the  nonlinear  diffusion  equation  was  derived  from 
the  Fick’s  law  with  the  assumption  of  almost  negligible  space  charge 
density.  However,  for  the  case  of  aNA  ~ 1,  the  space  charge  may  not  be 
omitted.  The  effect  causes  steeper  index  profiles  [Neum79]. 
Therefore,  the  electric  field  induced  by  the  space  charge  must  be 
considered  in  the  modeling  so  that  a more  accurate  index  profile  can 
be  predicted. 

It  is  believed  [Char80,  Walk83b]  that  if  a metallic  mask  is  used,  a 
local  field  from  the  mask  opening  region  to  the  region  under  the  mask 
can  be  generated  by  the  electrochemical  potential  differences  between 
the  interface  of  mask-glass  and  that  of  melt-glass.  Usually  the  local 
field  induced  by  the  mask  material  may  speed  up  the  horizontal 
diffusion  and,  hence,  the  side  diffusion  is  larger  than  the  vertical 
diffusion,  resulting  in  the  waveguide  width  being  much  larger  than  the 
waveguide  depth.  In  this  case,  the  fabrication  conditions  for  single- 
mode waveguide  are  more  critical,  as  mentioned  in  chapter  5.  To  solve 
the  problem,  dielectric  mask  materials  and  guard-band  openings  may 
be  used  to  help  reduce  the  local  field  induced  by  the  mask. 

Eletromigration  has  been  intensively  investigated  and  reported 
[Abou79,  Coop80,  Lili82,  Yosh85,  Kane85,  Kane86,  Forr86].  Under  an 
external  applied  field,  the  space  charge  density  becomes  even  larger 
than  the  case  without  the  field.  Recently,  Albert  and  Lit  [Albe90]  have 
demonstrated  the  modeling  of  field-assisted  ion  exchange.  However, 
they  assume  that  the  total  ion  concentrations  equal  the  initial 
concentration  of  the  host  ions,  i.e.,  CA  + CB  = CB0,  which  is  not  true 
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when  the  space  charge  is  not  negligible,  especially,  under  an  external 
field.  Further  study  on  the  space  charge  effect  is  needed. 

7.2.3.  2-D  Buried  Waveguides 

The  buried  waveguides  are  known  to  be  immune  to  surface 
roughness  and  their  scattering  loss  is  lower  than  surface  waveguides. 
In  addition,  a symmetric,  fiber-compatible  mode-field  profile  is 
achievable.  Mode  mismatch  loss  is  reduced  when  the  waveguide  is 
embedded  into  the  substrate. 

The  simulation  of  1-D  buried  waveguides  was  presented  in 
chapter  3.  The  buried  waveguides  are  usually  obtained  by  a two-step 
process.  The  modeling  for  the  2-D  buried  waveguides  needs  to  be 
performed  since  it  is  of  paramount  importance  in  designing  fiber- 
compatible  modal  fields. 

7-3.  Functional  Devices  and  Integrated  System 

In  this  section,  some  useful  passive  devices  are  proposed  to 
achieve  special  functions  in  optical  communication  systems,  e.g., 
coupling  to  fibers,  wide-band  MUX/DEMUX,  narrow-band  filters,  etc. 
Furthermore,  an  integration  of  these  devices  is  also  an  important  issue 
to  be  addressed. 


7.3.1.  Waveguide  Bends 


The  directional  couplers  discussed  in  chapter  6 are  constructed 
by  two  straight  channels  without  waveguide  bends.  As  a result, 


fteMWaVegUiduS  Can  3180  be  obtained  by  metallic  silver  film  diffusion 
with  applied  field  using  molten  salt  bath  as  the  electrodes  [Pant87,  Gunt88,  Zhen88], 
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difficulties  occur  in  separately  coupling  into  fibers  since  the  diameter 
of  the  fiber  is  about  125  pm,  which  implies  that  the  separation  of  the 
two  guides  at  the  output  should  be  at  least  125  pm.  As  shown  in  Fig.  7- 
1,  waveguide  bends  can  be  added  at  both  the  input  and  output  ends  for 
coupling  purpose.  They  are  necessary  for  practical  applications. 
However,  an  analysis  of  bent  waveguides  as  well  as  the  variation  in  the 
power  outputs  of  the  coupler  with  bends  was  not  carried  out  in  this 
work.  The  modeling  of  the  bends  in  practical  directional  couplers  can 
be  simulated  by  BPM.  The  loss  caused  by  the  bending  can  also  be 
evaluated  by  this  method.  To  minimize  the  radiation  loss  and  to 
increase  the  packing  density  of  devices,  gradual  index  tapers  along 
bends  with  the  maximum  index  change  occurring  in  the  shapest 
region  would  be  helpful  in  designing  smaller  radii  bends.  Mechnisms 
to  incorporate  such  tapered  bends  need  to  be  investigated. 

7.3.2.  Asymmetric  Y-Branch 

As  discussed  in  chapter  6,  a MUX/DEMUX  using  symmetric 
directional  couplers  is  sensitive  to  the  variation  of  the  interaction 
length,  as  shown  in  Eq.  (6.26),  (6.27), (6.36),  and  (6.37).  If  AL  is  not 
zero,  then  the  crosstalk  as  well  as  the  cross  power  ratio  will 
deteriorate  significantly.  An  asymmetric  Y-branch,  considered  to  be 
immune  to  the  length  variation,  is  proposed  by  Negami  et  al.  [Nega89] 
and  realized  by  Goto  and  Yip  [Goto90]  using  ion  exchange  technique. 
The  schematic  diagram  is  shown  in  Figs.  7-2(a)  and  (b).  The  two 
branches  have  different  index  profiles.  The  angle  0 of  the  Y-branch 
must  be  small  enough  (of  the  order  of  milli-radians)  to  satisfy  the 


234 


Fig.  7-1.  Directional  coupler  with  bends  for  coupling  with  fibers. 
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Fig.  7-2.  Asymmetrical  Y-branch  for  wide  band  MUX/DEMUX. 
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adiabatic  condition  so  that  no  mode  conversion  occurs  in  the  device. 
In  the  wavelength  region  longer  than  the  specified  wavelength  at 
which  the  phase  matching  condition  is  met,  one  branch  is  assumed  to 
have  a larger  propagation  constant  than  the  other  so  that  the 
fundamental  mode  of  the  input  guide  will  travel  along  this  branch.  On 
the  other  hand,  in  the  shorter  wavelength  region,  if  the  other  branch 
has  a larger  propagation  constant,  then  the  mode  will  travel  along  this 
branch.  Hence,  the  function  of  MUX/DEMUX  is  attainable.  In 
principle,  this  configuration  has  a wider  bandwidth  than  the 
symmetric  directional  coupler.  Although  the  crosstalk  is  predicted 
theoretically  to  be  -20  dB,  the  reported  crosstalk  is  measured  to  be 
between  -5  dB  and  -6  dB,  illustrating  that  some  improvements  still 
can  be  made  to  obtain  better  performance.  For  example,  one  can 
utilize  both  the  material  as  well  as  waveguide  dispersion. 

7.3.3.  Asymmetric  Directional  Coupler 

As  the  density  of  channels  in  optical  communication  becomes 
higher,  narrower  bandwidth  filters  for  multiplexing  and/or 
demultiplexing  the  signals  in  frequency  domain  are  likely  to  become 
essential  components  in  the  system.  The  asymmetric  directional 
coupler  is  designed  such  that  phase  matching  is  satisfied  at  only  one 
wavelength  where  the  dispersion  curves  of  these  two  guides  intersect. 
At  this  wavelength,  if  the  interaction  length  of  the  coupler  exactly 
equals  an  odd  number  of  coupling  lengths,  then  the  power  can 
completely  transfer  to  the  second  waveguide.  At  the  other 
wavelengths,  complete  power  transfer  is  not  possible  since  the  phase 
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matching  condition  is  not  met.  Such  a device  has  been  demonstrated 
in  LiNb03  by  Alferness  [Alfe78];  the  bandwidth  of  this  device  is  very 
narrow  so  that  it  is  suitable  for  using  as  MUX/DEMUX  in  high-density 
communication  system.  The  realization  of  this  device  in  ion-exchanged 
waveguides  would  be  similar  to  the  asymmetric  Y-branch.  The 
difference  is  that  the  two  guides  are  parallel  to  each  other  in  this  case. 

7.3.4.  Glass  Waveguide  Laser  and  Integrated  System 

Neodymium  doped  glass  lasers  were  developed  more  than  two 
decades  ago.  However,  laser  action  in  ion-exchanged  waveguides  has 
been  demonstrated  in  Nd:glass  only  recently  [Sanf90,  Aoki90].  Their 
miniature  structure  and  their  potential  to  integrate  with  other  ion- 
exchanged,  passive  devices  makes  them  attractive.  Sanford  et  al 
demonstrated  the  fabrication  by  K+-Na+  exchange,  while  Aoki  et  al 
used  metallic  silver  film.  In  both  cases,  external  fields  are  employed. 
Lasing  wavelengths  are  1.054  and  1.057  nm,  respectively.  However, 
the  integration  of  active  and  passive  devices  is  yet  to  be  reported. 

The  integrated  system  consisting  of  light  sources  and  passive 
devices  can  be  applied  to  not  only  optical  communication  but  also 
optical  sensing.  By  using  ion  exchange  in  glass,  the  fabrication  of  the 
integrated  system  is  easy  and  the  cost  of  processing  is  reduced. 
Further  research  work  needs  to  be  performed  to  achieve  such  an 
integrated  system. 
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